563 


Mil l  SOU* 

mtsma, 


n 


B(6flHW1Wti/«*«^,U^  W®! 

tsitt,  ;  mu,  **v» 


;  / 
I  • 


The  contents  of  RAC  publications,  including  the  conclusions, 
represent  the  views  of  RAC  and  should  not  be  considered  to 
have  official  Department  of  the  Army  approval,  either  expressed 
or  implied,  until  reviewed  and  evaluated  by  that  agency  and 
subsequently  endorsed. 


THIS 

PAGE 

IS 

MISSING 

IN 

ORIGINAL 

DOCUMENT 


UTILITY  THEORY 

FOR 

DECISION  MAKING 


PUBLICATIONS  IN  OPERATIONS  RESEARCH 


Operations  Research  Society  of  America 
Editor  for  Publications  to  Operations  Research 
David  B.  Hertz 


No.  I,  Queues,  Inventories  and  Maintenance 
Philip  M.  Morse 

No.  2.  Finite  Queuing  Tables 
L.  G.  Peck  and  R.  N.  Hazelwood 

No.  3.  Efficiency  in  Government  through  Systems  Analysis 
Ro’fUid  N.  McKean 

No.  4.  A  Comprehensive  Bibliography  on  Operations  Research* 
Operations  Research  Group,  Case  Institute 

No.  5.  Progress  in  Operations  Research,  Volume  I 

Edited  by  Russel!  L.  Ackoff 

No.  6.  Statistical  Management  op  Inventory  Systems* 

Harvey  M.  Wagner 

No.  7.  Price,  Output,  and  Inventory  Policy* 

Edwin  S.  Mills 

No.  8.  A  Comprehensive  Bibliography  on  Operations  Research,  1957-1958* 
Operations  Research  Group,  Case  Institute 

No.  9.  Progress  in  Operations  Research,  Volume  II 
David  B.  Hertz  and  Roger  T.  Eddison 

No.  10.  Decision  and  Value  Theory 

Peter  C.  Fishbum 

No.  11.  Handbook  of  the  Poisson  Distribution 

Frank  A.  Haight 

No.  12.  Operations  Research  in  Sellers’  Competition:  A 
Stochastic  Microtheory 
S.  Sankar  Sengupta 

No.  13.  Bayesian  Decision  Problems  &  Markov  Chains 
J.  J.  Martin 

No.  14.  Mathematical  Models  of  Arms  Control  and  Disarmament: 
Application  of  Mathematical  Structures  in  Politics 
Thomas  L.  Saaty 

No.  15.  Fourth  International  Conference  on  Operations  Research 

David  B.  Hertz  and  Jacques  Melese 

No.  16.  Progress  in  Operations  Research,  Volume  III 
Relationship  Between  Operations  Research  and  the  Computer 
J.  S.  Aronofsky 

No.  17.  Introduction  to  Systems  Cost  Effectiveness 

Karl  Seiler,  III 

No.  18.  Utility  Theory  for  Decision  Making 
Peter  C.  Fishbum 

No.  19.  The  Implementation  of  Operations  Research 

Jan  H.  B.  Huysmans 

No.  20.  The  Challenge  to  Systems  Analysis:  Public  Policy  and  Social  Change 

Edit-d  by  Grace  J.  Kelleher 

No.  21.  Quantitative  Theories  in  Advertising 
A.  G.  Rao 


*  Out-of-print. 


% ; 


if- 

% 

Ir¬ 

ak. 


■S. 


; 


UTILITY  THEORY 

FOR 

DECISION  MAKING 


PETER  C.  FISHBURN 


Research  Analysis  Corporation 


JOHN  WILEY  &  SONS,  INC. 


NEW  YORK • LONDON • SYDNEY . TORONTO 


Printed  in  the  United  States  of  America 
10  ^87654321 


rftBttWiOl'wS  *«  **'Wi 


FOREWORD 


This  book  presents  a  unified  treatment  of  normative 
theories  for  the  evaluation  of  individuals’  preferences  in 
a  variety  of  types  of  decision  situations.  The  material 
was  corn ~~A  developed  as  part  of  RAC’s  Advanced 
Research  Department  work  program  in  decision  and  value 
theory.  Many  of  the  results  in  the  book  were  developed 
as  a  result  of  basic  research  investigations  under  the 
RAC  Institutional  Research  Program  in  addition  to  ONR 
and  ARO  support. 


NICHOLAS  M.  SMITH 
Head,  Advanced  Research  Department 


PREFACE 


The  underlying  motive  for  this  book  is  the  widespread  activity  of  human 
dscisii.il  making.  Its  bask  motif  is  that  decisions  depend,  at  least  in  part, 
on  preferences.  Its  subject  matter  is  preference  structures  and  numerical 
representations  of  preference  structures. 

Although  utility  theory  has  well-recognized  roots  that  extend  into  the 
eighteenth  and  nineteenth  centuries,  much  of  its  significant  growth  has 
occurred  in  the  last  two  or  three  decades.  This  growth,  whose  major  con¬ 
tributions  have  come  out  of  economics,  statistics,  mathematics,  psychology, 
and  the  management  sciences,  has  been  greatly  stimulated  by  the  use  of 
axiomatic  theory.  This  is  evident,  for  example,  in  the  works  of  Frank  P. 
Ramsey  (1931),  John  von  Neumann  and  Oskar  Morgenstern  (1947),  Leonard 
J.  Savage  (1954),  John  S.  Chipman  (1960),  and  Gerard  Debreu  (1959,  1960), 
all  of  which  use  the  axiomatic  approach.  In  this  approach  the  investigator 
puts  forth  a  set  of  axioms  or  conditions  for  preferences.  It  might  be  said  that 
these  conditions  characterize  a  preference  structure.  Some  of  them  may  be 
viewed  as  criteria  of  consistency  and  coherence  for  the  preferences  of  a 
decision  maker;  others  may  be  viewed  as  structural  and/or  simplifying 
assumptions.  In  any  event,  the  investigator  then  seeks  to  uncover  a  numerical 
model  that  preserves  certain  characteristics  inherent  in  the  assumed  preference 
structure.  Further  investigation  might  indicate  how  such  a  model  can  be 
used  to  help  decision  makers  examine  and  perhaps  resolve  decision  problems. 
This  can  include  methods  of  estimating  the  terms  (utilities,  probabilities) 
that  appear  in  the  model. 

During  1963  through  1969,  while  this  book  progressed  through  its  own 
growth  md  distillation  stages,  I  have  been  increasingly  concerned  by  the 
needs  for  a  unifying  upper-level  text  and  a  research-reference  work  on 
utility  theory.  It  is  my  hope  that  the  book  will  satisfy  these  needs  for  at  least 
the  next  several  years. 

The  book  was  written  to  be  self-contained.  My  experience  indicates  that 
many  people  interested  in  utility  theory  are  not  especially  well  trained  in 
mathematics.  For  this  reason  and  to  prevent  any  misunderstanding,  I  have 

vil 


Preface 


viii 

included  virtually  all  required  background  mathematics.  This  matertd  is 
introduced  when  and  where  it  is  needed.  Those  unfamiliar  with  it  will  of 
course  find  much  of  it  difficult  going,  but  at  least  I  hope  they  will  be  spared 
the  trouble  of  searching  elsewhere  for  it. 

Also  by  way  of  self-containment,  proofs  are  provided  for  all  but  a  very 
few  theorems.  Browsers  will  want  to  skip  the  proofs,  bnt  they  are  available 
when  desired.  In  most  cases,  source  credit  is  given  for  more  involved  proofs. 
In  some  cases  I  have  expanded  others’  proofs  to  make  them  more  accessible 
to  some  readers  This  is  most  noticeable  with  respect  to  Dcbrcu’s  additivity 
theory  in  Chapter  5  and  Savage’s  expected-utility  theory  in  Chapter  14. 

Set  theory  is  the  cornerstone  mathematics  of  the  text.  With  no  significant 
exception,  all  utility  theories  examined  in  the  book  are  based  on  the  theory 
of  binary  relations.  The  main  binary  relation  is  the  preference  relation  “is 
preferred  to.”  Algebra,  group  theory,  topology,  probability  theory,  and  the 
theory  of  mathematical  expectation  arise  at  various  places. 

The  exercises  are  an  integral  part  of  the  book.  Those  with  boldface  numbers 
cover  important  material  not  presented  elsewhere  in  the  chapters.  Other 
exercises  offer  practice  on  the  basic  mathematics  and  on  the  utility  theory 
and  related  materials  discussed  in  the  chapters.  Answers  to  selected  exercises 
follow  Chapter  14.  A  preview  of  the  book’s  contents  is  given  in  the  first 
chapter. 

Finally,  you  should  know  about  two  other  books  that  present  a  significant 
amount  of  material  on  measurement  theory  (of  which  utility  theory  may  be 
considered  a  part)  that  is  not  found  in  this  book.  The  first  of  these  is  John 
Pfanzagl’s  Theory  of  Measurement  (John  Wiley  &  Sons,  Inc.,  New  York, 
1968),  The  second  is  being  prepared  by  David  H.  Krantz,  R.  Duncan  Luce, 
Patrick  Suppes,  and  Amos  Tversky. 


McLean ,  Virginia 
June  1969 


Peter  C.  Fishburn 


ACKNOWLEDGMENTS 


The  preparation  of  this  book  was  made  possible  by  the  joint  support  of  the 
Department  of  the  Army,  DA  Contract  No.  44-188- ARO-1,  and  the  Office 
of  Naval  Research,  ONR  Contract  No.  N00014-67-C-0434. 

Since  1963  I  have  been  a  member  of  the  Advanced  Research  Department, 
Research  Analysis  Corporation.  I  am  extremely  grateful  to  Dr.  Nicholas  M. 
Smith,  Jr.,  my  department  head,  and  to  Frank  A.  Parker,  President  of  the 
Research  Analysis  Corporation,  for  their  continued  encouragement  and 
support  of  my  work  during  this  time.  I  am  indebted  also  to  Dr.  George  S. 
Pettee,  Chairman  of  RAC’s  Open  Literature  Committee,  and  to  his  com¬ 
mittee,  for  their  guidance.  Mrs.  Elva  Baty  typed  the  lint  draft  of  the  book  in 
1963,  and  it  was  reviewed  by  Dr.  Irving  H.  Siegel,  Dr.  Richard  M.  Soland, 
Dr.  Jerome  Bracken,  and  Mr.  Robert  Busacker,  all  of  RAC  at  that  time. 
The  second  draft,  completed  in  1968,  was  typed  by  Mrs.  Virginia  M.  Johnson, 
who  also  typed  the  final  draft.  I  offer  my  sincerest  thanks  to  these  friends 
and  many  others  at  RAC  who  have  participated  in  the  preparaticn  of  the 
work. 

Dr.  David  B.  Hertz,  Editor  of  this  series  for  ORSA,  personally  read  the 
material  for  this  book  as  it  was  developed.  I  am  grateful  to  him  for  his 
continued  encouragement  and  counsel,  i'nd  to  the  Publications  Committee 
of  ORSA  for  its  decision  to  publish. 

The  original  idea  for  this  book  came  from  Professor  Russell  L.  Ackoff. 
Another  great  teacher.  Professor  Leonard  J.  Savage,  helped  me  to  under¬ 
stand  his  own  utility  theory  and  was  instrumental  in  developing  the  contents 
of  Chapter  10.  He  made  many  useful  suggestions  on  the  material  in  Chapters 
11  and  13  also.  Encouragement  and  help  on  the  first  part  of  the  book  came 
from  Professors  Gerard  Debreu,  David  H.  Krantz,  R.  Duncan  Luce,  and 
Marcel  K.  Richter. 

I  am  indebted  also  to  many  editors  and  referees.  In  addition  to  sources 
acknowledged  in  the  text,  I  would  like  to  thank  those  who  have  helped  me 
with  articles  whose  contents  appear  in  part  in  Chapters  9  through  13.  A  few 
items  in  the  exercises  of  Chapter  9  appeared  in  “Semiorders  and  Risky 

ix 


X 


Acknowledgments 


Choices,”  Journal  of  Maihematieal  Psychology  5  (1968),  358-361.  Chapter  10 
grew  out  of  “Bounded  Expected  Utility,”  The  Annals  of  Mathematical 
Statistics  38  (1967),  1054-1060.  Chapter  11  is  based  on  “Independence  in 
Utility  Theory  with  Whole  Product  Sets,”  Operations  Research  13  (1965), 
28-45;  “Markovian  Dependence  in  Utility  Theory  with  Whole  Product 
Sets,”  Operations  Research  13  (1965),  238-257;  “Independence,  Trade-Offs, 
and  Transformations  in  Bivariate  Utility  Functions,”  Management  Science 
11  (1965),  792-801;  “Stationary  Value  Mechanisms  and  Expected  Utility 
Theory,”  Journal  of  Mathematical  Psychology  3  (1966),  434-457;  “Conjoint 
Measurement  in  Utility  Theory  with  Incomplete  Product  Sets,”  Journal  of 
Mathematical  Psychology  4  (1967),  104-119;  “Additive  Utilities  with  In¬ 
complete  Product  Sets :  Application  to  Priorities  and  Assignments,”  Opera¬ 
tions  Research  15  (1967),  537-542;  “Interdependence  and  Additivity  in 
Multivariate,  Unidimensional  Expected  Utility  Theory,”  International 
Economic  Review  8  (1967),  335-342;  and  “A  Study  of  Independence  in 
Multivariate  Utility  Theory,”  Econometrica  37  (1969),  107-121 .  Some  material 
in  Chapter  12  appeared  in  “An  Abbreviated  States  of  the  World  Decision 
Model,”  IEEE  Transactions  on  Systems  Science  and  Cybernetics  4  (1968), 
300-306.  Chapter  13  is  based  on  “Preference-Based  Definitions  of  Subjective 
Probability,”  Annals  of  Mathematical  Statistics  38  (1967),  1605-1617,  and 
“A  General  Theory  of  Subjective  Probabilities  and  Expected  Utilities,” 
Annals  of  Mathematical  Statistics  40  (1969),  1419-1429.  In  addition  to  these, 
the  material  on  interval  orders  in  Section  2.4  is  based  on  “Intransitive 
Indifference  with  Unequal  Indifference  Intervals,”  Journal  of  Mathematical 
Psychology  7  (1970),  and  many  of  the  results  for  strict  partial  orders 
throughout  the  book  are  summarized  in  “Intransitive  Indifference  in 
Preference  Theory:  A  Survey,”  Operations  Research  18  (1970). 

Finally  and  foremost,  I  thank  my  wife  Janet  and  our  children  for  their 
love,  and  Rebecca  and  Hummel,  who  made  it  possible. 


P.C.F. 


CONTENTS 


i 

i 


1  Introduction  and  Preview  1 

1.1  General  Organization  1 

1.2  Part  I:  Utilities  without  Probabilities  2 

1.3  Parts  II  and  III:  Utilities  with  Probabilities  3 

PART  I  UTILITIES  WITHOUT  PROBABILITIES 

2  Preference  Orders  and  Utility  Functions  for  Countable  Sets  9 

2.1  Binary  Relations  10 

2.2  Preference  as  a  Weak  Order  1 1 

2.3  Preference  as  a  Strict  Partial  Order  15 

2.4  Ordered  Indifference  Intervals  18 

2.5  Summary  22 

3  Utility  Theory  for  Uncountable  Sets  26 

3.1  The  Denseness  Axiom  and  Weak  Orders  26 

3.2  Preference  as  a  Strict  Partial  Order  29 

3.3  Preferences  on  Ren  31 

3.4  Continuous  Utilities  35 

3.5  Summary  39 

4  Additive  Utilities  with  Finite  Sets  42 

4.1  Preference  Independence  among  Factors  43 

4.2  Theorem  of  The  Alternative  46 

xi 


Comtentt 


xll 


4,3  Lexicographic  Utilities 

48 

4.4  Summary 

50 

Additive  Utilities  with  Infinite  Seta 

54 

5.1  Strictly  Ordered  Groups 

54 

5.2  Algebraic  Theory  for  ti  Factors 

57 

5.3  Topological  Preliminaries 

62 

5.4  Topological  Theory  for  n  Factors 

65 

5.5  Summary 

76 

Comparison  of  Preference  Differences 

80 

5.1  “Measurable”  Utility 

81 

6.2  Theory  with  Finite  Sets 

82 

6,3  Review  of  Infinite-Set  Theories 

84 

6.4  Summary 

86 

Preferences  on  Homogeneous  Product  Sets 

89 

7.1  Persistence  and  Impatience 

90 

7.2  Persistent  Preference  Differences 

92 

7.3  Constant  Discount  Rates 

95 

7.4  Summary 

97 

PART  H  EXPECTED-UTILITY  THEORY 


8  Expected  Utility  with  Simple  Probability  Measures  103 

8.1  Example  103 

8.2  Simple  Probability  Measures  105 

8.3  Expected  Utility  for  Simple  Measures  107 

8.4  Mixture  Sets  1 10 

8.5  Summary  115 

9  Expected  Utility  for  Strict  Partial  Orusrs  121 

9.1  An  Expected  Utility  Theorem  121 

9.2  Convex  Sets  and  Cones  122 


Contents  \iii 

9.3  Proof  of  Theorem  9.1  125 

9.4  Summary  126 

10  Expected  Utility  for  Probability  Measures  129 

10.1  Two  Examples  129 

10.2  Probability  Measures  130 

10.3  Expectations  134 

10.4  Preference  Axioms  and  Bounded  Utilities  137 

10.5  Theorems  139 

10.6  Proofs  of  Theorems  10.1,  10.3,  and  10.5  142 

10.7  Summary  144 

11  Additive  Expected  Utility  148 

11.1  Expected  Utility  with  X  —TlXi  148 

11.2  Additive  Expectations  with  X  g  nxt  150 

11.3  Additive,  Interdependent  Expectations  for  UXf  153 

11.4  Probability  Measures  on  Homogeneous  Product  Sets  156 

11.5  Summary  157 

PART  m  STATES  OF  THE  WORLD 

12  States  of  the  World  163 

12.1  States  and  States  163 

12.2  Expected  Utility  Preview  166 

12.3  Models  without  State  Probabilities  168 

12.4  Summary  170 

13  Axioms  with  Extraneous  Probabilities  175 

13.1  Horse  Lotteries  175 

13.2  Finite  States  Theory  177 

13.3  Homogeneous  Horse  Lottery  Theory  178 

13.4  The  Part  II  Decision  Model  185 

13.5  Summary  186 


COMtttUS 


Xht 


14  Savage’s  Expected-Utility  Theory 

191 

14.1  Savage’s  Expected-Utility  Theorem 

191 

14.2  Axioms  for  Probability 

194 

14.3  Probabilities  from  Preferences 

200 

14.4  Utility  for  Simple  Acts 

201 

14.5  Utilities  Are  Bounded 

206 

14.6  Utility  for  All  Acts 

207 

14.7  Summary 

210 

Answers  to  Selected  Exercises 

215 

References 

223 

Author  Index 

229 

Subject  Index 

231 

UTILITY  THEORY 

FOR 

DECISION  MAKING 


Chapter  1 

INTRODUCTION  AND  PREVIEW 


Decision  making  serves  as  the  foundation  on  which  utility  theory  rests.  For 
the  purposes  of  this  book  we  envision  a  decision  maker  who  must  select  one 
alternative  (act,  course  of  action,  strategy)  from  a  recogni2ed  set  of  decision 
alternatives.  Our  study  will  focus  on  individuals’  preferences  in  such  decision 
situations.  For  a  connection  between  decision  and  preference  we  shall  assume 
that  preferences,  to  a  greater  or  lesser  extent,  govern  decisions  and  that, 
generally  speaking,  a  decision  maker  would  rather  implement  a  more  pre¬ 
ferred  alternative  than  one  that  is  less  preferred. 

In  the  axiomatic  systems  examined  in  this  book,  an  individual’s  preference 
relation  on  a  set  of  alternatives  enters  as  a  primitive  or  basic  notion.  This 
means  that  we  shall  not  attempt  to  define  preference  in  terms  of  other 
concepts.  We  shall,  however,  suggest  that,  by  self-interrogation,  an  individual 
can  identify  at  least  some  of  his  preferences. 

As  we  proceed  through  various  types  of  decision  situations  it  will  become 
apparent  that,  under  specified  assumptions,  preferences  between  decision 
alternatives  might  be  characterized  in  terms  of  several  factors  relating  to  the 
alternatives.  In  cases  where  alter  natives  can  be  viewed  as  aggregates  of  several 
attributes  or  factors,  holistic  preferences  might  be  represented  as  aggregates 
of  preferences  on  the  several  factors.  In  other  cases,  as  in  decision  under 
uncertainty,  holistic  preferences  may  be  represented  in  terms  of  utilities  for 
consequences  and  probabilities  for  consequences  or  for  “states  of  the  world.” 
These  special  ways  of  representing  preferences  do  not  of  course  explain  the 
meaning  of  the  term  although  they  may  help  in  understanding  how  holistic 
preferences  can  be  described  in  terms  of  other  factors. 

1.1  GENERAL  ORGANIZATION 

The  three  main  parts  of  the  text  comprise  two  main  divisions  of  our  subject 
as  follows: 

Part  I.  Individual  decision  under  certainty. 

Parts  II  and  III.  Individual  decision  under  uncertainty. 


l 


2 


introdHCthn  and  Preview 


Part  I,  titled  “Utilities  without  Probabilities,”  covers  situations  where 
uncertainty  is  not  explicitly  formulated.  I  use  the  phrase  “decision  under 
certainty”  as  an  abbreviation  for  something  like  “decision  making  in  which 
uncertainty,  whatever  form  it  might  take,  is  suppressed  and  not  given  explicit 
recognition.” 

Parts  II  and  III  explicitly  recognize  the  form  of  uncertainty  that  is  charac¬ 
terized  by  the  question:  If  I  implement  decision  alternative /,  then  what  will 
happen  ?  Parts  II  and  III  differ  in  their  formulations  of  an  uncertain  situation, 
although  under  appropriate  interpretation  the  two  formulations  are  equiva¬ 
lent.  In  Part  II,  titled  “Expected-Utility  Theory,”  the  uncertainty  is  expressed 
in  terms  of  the  probability  that  consequence  x  will  result  if  act  /  is  imple¬ 
mented.  In  Part  III,  “States  of  the  World,”  uncertainty  is  expressed  in  terms 
of  probabilities  for  contingencies  whose  occurrence  cannot  be  influenced  by 
the  specific  act  that  is  implemented  but  which  determine  the  consequence  that 
results  under  each  available  act.  The  Part  II  formulation  is  the  one  used  in 
Fishburn  (1964).  The  Part  III  formulation  is  the  one  adopted  in  the  version 
of  statistical  decision  theory  sponsored  by  Savage  (1954)  and  Raiffa  and 
Schlaifer  (1961). 

In  the  actual  presentations  of  Parts  II  and  III  there  is  another  noticeable 
difference.  In  Part  III,  especially  Chapters  13  and  14,  the  state  probabilities 
as  well  as  the  utilities  are  derived  from  the  preference  axioms.  In  Part  II 
probabilities  of  acts  for  consequences  are,  so  to  speak,  taken  as  given  and 
enter  into  the  axioms.  This  is  partly  rectified  in  Section  13.4,  which  presents 
an  axiomatization  for  the  Part  II  formulation  in  which  the  consequence 
probabilities  are  derived  from  the  axioms.  An  alternative  axiomatization  of 
the  Part  II  model  that  also  does  not  use  consequence  probabilities  in  the 
axioms  has  been  developed  recently  by  Duncan  Luce  and  David  ICrantz. 
Since  this  awaits  publication  as  I  am  completing  this  book,  its  important 
contributions  do  not  appear  here. 

1,2  PART  I:  UTILITIES  WITHOUT  PROBABILITIES 

A  natural  first  topic  for  a  study  on  utility  theory  is  the  elementary  prop¬ 
erties  of  a  preference  relation  on  a  set  of  decision  alternatives.  The  next  two 
chapters  go  into  this  in  some  detail.  Their  main  concern  is  what  might  be 
called  the  fundamental  theorem  of  utility.  This  has  to  do  with  axioms  for 
preferences  which  guarantee,  in  a  formal  mathematical  sense,  the  ability  to 
assign  a  number  (utility)  to  each  alternative  so  that,  for  any  two  alternatives, 
one  is  preferred  to  the  other  if  and  only  if  the  utility  of  the  first  is  greater  than 
the  utility  of  the  second. 

These  two  chapters  differ  primarily  in  the  size  assumed  for  the  set  of 
alternatives.  Chapter  2  assumes  that  this  set  is  finite  or  denumerably  infinite; 


Utilities  with  Probabilities  3 

Chapter  3  covers  cases  where  the  alternative  set  is  so  large  that  it  is  uncount¬ 
able  (neither  finite  nor  denumerable).  After  dealing  with  the  fundamental 
theorem,  Chapter  2  discusses  ordering  properties  on  preferences  that  are  not 
strong  enough  to  yield  the  fundamental  theorem.  Here  we  shall  not  assume 
that  indifference  (“no  preference”)  is  transitive.  Along  with  the  fundamental 
theorem  as  such,  Chapter  3  gives  sufficient  conditions  for  order-preserving 
utilities  when  the  alternative  set  is  a  subset  of  finite-dimensional  Euclidean 
space,  and  then  goes  on  to  consider  continuous  utility  functions. 

Additive  Utilities 

Chapters  4,  5,  and  7  deal  with  cases  where  each  alternative  can  be  viewed 
as  a  multiple-factor  or  multipie-attribute  entity,  in  more  mathematical  terms, 
each  alternative  is  an  w-tuple  of  elements,  one  element  from  each  of  a  set  of  n 
factors.  Unlike  the  other  chapters  in  this  trio,  Chapter  7  deals  explicitly  with 
the  case  where  the  n  factors  are  essentially  similar.  A  prototype  example  for 
i  Chapter  7  is  the  case  where  n  denotes  a  number  of  time  periods  and  an 

alternative  specifies  income  in  each  period.  Time-oriented  notions  of  per¬ 
sistent  preferences,  impatience,  stationarity,  and  marginal  consistency  are 
’  examined  in  Chapter  7,  as  well  as  a  persistent  preference  difference  concept 

;  that  draws  on  material  in  Chapter  6. 

t  Chapters  4  and  5  deal  with  preference  conditions  on  a  set  of  multiple-factor 

alternatives  that  not  only  yield  order-preserving  utilities  as  in  Chapters  2  and 
3  but  also  enable  the  utility  of  each  alternative  to  be  written  as  the  sum  of 
utility  numbers  assigned  to  each  of  the  n  components  of  the  alternative.  In 
simpler  language,  these  chapters  deal  with  conditions  that  imply  that  the 
utility  of  a  whole  can  be  expressed  as  the  sum  of  utilities  of  its  parts.  In 
Chapter  4  the  alternative  set  is  taken  to  be  finite;  in  Chapter  5  the  number  of 
alternatives  is  infinite. 

Strength  of  Preference 

Chapter  6  is  the  only  chapter  in  the  book  that  deals  primarily  with  utility 
concepts  involving  strength  of  preference  or  preference  intensity.  It  is  con¬ 
cerned  with  comparisons  between  pairs  of  alternatives  and  raises  the  ques¬ 
tion:  Is  your  difference  in  preference  (degree  of  preference)  between  these 
two  alternatives  less  than,  equal  to,  or  greater  than  your  difference  in 
preference  between  those  two  alternatives?  Chapter  6  is  concerned  with 
utility  functions  that  preserve  such  preference-difference  comparisons. 

13  PARTS  II  AND  III:  UTILITIES  WITH  PROBABILITIES 

As  noted  above,  Parts  II  and  III  differ  in  their  formulations  of  decision  under 
uncertainty.  Both  parts  are  concerned  with  simple  preference  comparisons 


4 


Introduction  end  Preview 


between  alternatives  whose  consequences  are  uncertain,  and  with  pref¬ 
erence  conditions  that  not  only  yield  order-preserving  utilities  for  the 
alternatives  but  also  enable  the  utility  of  an  alternative  to  be  written  as  a 
mathematical  expectation  involving  consequence  utilities  and  consequence 
probabilities. 

In  this  book,  probability  is  interpreted  in  a  subjective  or  personal  way. 
Roughly  speaking,  a  probability  is  a  numerical  expression  of  the  confidence 
that  a  particular  person  has  in  the  truth  of  a  particular  proposition,  such  as 
the  proposition  “if  I  implement  /  then  consequence  x  will  result,”  or  the 
proposition  “this  coin  will  land  ‘heads’  on  the  next  flip.”  Such  probabilities 
are  required  to  obey  well-defined  rules  of  coherence  and  consistency.  In  those 
cases  where  probabilities  are  derived  from  preference  axioms,  the  primitive 
notion  for  probability  is  preference.  Early  in  Chapter  14  we  shall  see  how 
probability  can  be  axiomatized  in  terms  of  a  relation  “is  less  probable  than” 
on  a  set  of  propositions  or  events.  Later  in  Chapter  14  we  shall  see  how  “is 
less  probable  than”  can  be  defined  in  terms  of  “is  preferred  to.”  My  own 
viewpoint  on  probability  is  heavily  influenced  by  de  Finetti  (1937)  and 
Savage  (1954).  Kyburg  and  Smokier  (1964)  is  recommended  for  further 
introductory  reading  in  subjective  probability.  Chapter  5  of  Fishburn  (1964) 
discusses  other  interpretations  of  the  meaning  of  probability. 

Part  II 

The  first  three  chapters  of  Part  II  derive  the  expected-utility  representation 
for  alternatives  with  uncertain  consequences.  In  these  chapters  the  conse¬ 
quence  probabilities  are  taken  as  “givens”  so  that  the  alternatives  in  the 
preference  axioms  are  probability  distributions  or  measures  on  a  set  of 
consequences.  Chapter  8  concentrates  on  simple  probability  measures,  where 
each  alternative  has  probability  one  (certainty)  of  resulting  in  a  consequence 
from  some  finite  subset  of  consequences.  Chapter  9  considers  simple  measures 
also  but,  unlike  Chapter  8,  it  does  not  assume  that  indifference  is  transitive. 
Chapter  10  admits  more  general  probability  measures  on  the  consequences. 

Uncertainty  is  combined  with  multiple-factor  consequences  in  Chapter  11. 
This  chapter  identifies  conditions  that  enable  the  expected  utility  of  an 
uncertain  alternative  with  n-tuple  consequences  to  be  expressed  as  the  sum 
of  expected  utilities  for  each  of  the  n  factors.  Section  11.4,  like  Chapter  7, 
examines  the  case  where  the  n  factors  are  essentially  similar. 

Part  ffl 

The  three  chapters  in  Part  III  deal  with  the  basic  states  of  the  world  decision 
formulation.  Chapter  12  introduces  this  formulation,  demonstrates  its  equiv¬ 
alence  to  the  Part  II  formulation,  and  considers  some  axioms  that  do  not 
yield  the  complete  expected-utility  subjective-probability  representation. 


Utilities  with  ProbabitliUs  5 

HeciMLw;  zz  rr^r  rr  r  * 

chapter,  but  they“  e  ^r!„ "  *abab”,UK  «e  used  in  the  axioms  of  this 
probabiliues.  Tie  latter  are  derivedTrom'I'be'a?^'1"''5  ““  "0t  “*  “**“ 

-  ^££££2*222  2E2«?"?. »  «*» 


PART 


I 


UTILITIES  WITHOUT 
PROBABILITIES 


With  few  exceptions,  most  of  the  significant  developments  in  individual 
utility  theory  for  preference  structures  that  do  not  explicitly  incorporate 
uncertainty  or  probability  have  occurred  since  the  beginning  of  the  twentieth 
century.  Economists  and  mathematical  economists  are  largely,  though  not 
exclusively,  responsible  for  these  developments.  The  basic  theory  (Chapters 
2  and  3)  deals  with  the  existence  of  utility  functions  on  a  set  of  alternatives 
that  preserves  the  ordering  of  the  alternatives  based  on  an  individual’s 
preference  relation,  and  with  special  properties — such  as  continuity — of 
utility  functions.  A  secondary  basic  development  (Chapter  6)  centers  on  a 
strength-of-preference  concept  that  concerns  comparisons  of  preference 
differences. 

Although  the  assumption  of  additive  utilities  for  multiple-factor  situations 
(Chapters  4  and  5)  was  widely  used  by  economists  in  the  mid-nineteenth 
century,  it  was  discarded  by  many  toward  the  end  of  the  century.  In  more 
recent  years,  principally  since  1959,  axiomatic  theories  for  additivity  have 
been  developed.  These  theories  show  what  must  be  assumed  about  preferences 
so  that  the  order-preserving  utility  functions  can  be  written  as  combinations 
of  utility  functions  for  the  several  factors. 


7 


Chapter  2 


PREFERENCE  ORDERS  AND  UTILITY 
FUNCTIONS  FOR  COUNTABLE  SETS 


Throughout  the  book  we  shall  let  X  denote  a  set  whose  elements  are  to  be 
evaluated  in  terms  of  preference  in  a  particular  decision  situation.  Depending 
on  the  context,  the  elements  in  X  might  be  called  alternatives,  consequences, 
commodity  bundles,  cash  flows,  systems,  allocations,  inventory  policies, 
strategies,  and  so  forth.  This  chapter  is  primarily  but  not  exclusively  con¬ 
cerned  with  cases  where  X  is  a  countable  set,  which  means  that  X  is  finite  or 
denumerable.  A  set  is  denumerable  if  and  only  if  its  elements  can  be  placed  in 
one-to-one  correspondence  with  the  elements  in  the  set  {1,2,3,.,.}  of 
positive  integers.  The  set  {. . .  ,  —2,  —  1 , 0, 1 , 2, . . .}  of  all  integers  and  the 
set  of  rational  numbers  (expressible  as  ratios  of  integers)  are  denumerable. 

Throughout  the  book  we  shall  take  strict  preference  <  as  the  basic  binary 
relation  on  X  or  on  a  set  based  on  X ,  and  indifference  ~  will  be  defined  as 
the  absence  of  strict  preference.  One  could  also  begin  with  a  preference- 
indifference  relation  ^  (read  x  ^  y  as  x  is  not  preferred  to  y),  but  I  have 
come  to  prefer  <  for  several  technical  reasons  plus  the  fact  that  we  tend  to 
think  in  terms  of  preference  rather  than  preference-indifference. 

The  first  main  result  of  this  chapter  is  that,  when  X  is  countable,  numbers 
u(x),  u(y), . . .  can  be  assigned  to  the  elements  x,  y, ...  in  X  in  such  a  way 

that  x<yo  u(x)  <  u(y) 

holds  if  <  on  A'  is  a  weak  order  (Definition  2.1).  The  <=>  means  “if  and  only 
if”  and  its  companion  =>  means  “implies.”  A  second  main  result  says  that 
there  is  a  real-valued  function  u  on  X  such  that 

x  <  y  =>  u(x )  <  u(y) 

when  <  on  A"  is  a  strict  partial  order  (Definition  2.2),  provided  that  X  is 
countable.  Several  other  utility-representation  theorems  are  presented  later 
in  the  chapter. 


9 


10 


Preference  Orders  for  Countable  Sets 


2.1  BINARY  RELATIONS 


Th?  entire  book  is  based  on  binary  relations.  A  binary  relation  on  a  set  Y 
is  a  set  of  ordered  pairs  (x,  y)  with  xe  Y and  y  e  Y.  The  x  e  Y means  that  * 
is  an  element  in  Y;  we  often  abbreviate  x  e  Y,  y  e  Y  by  writing  x,  y  e  Y. 

The  universal  binary  relation  on  7  is  the  set  {(z,  y)\x,ye  Y}  of  all  ordered 
pairs  from  Y.  In  general  is  the  set  of  all  elements  z  that  satisfy  the 
conditions  specified  by  If  R  is  a  binary  relation  on  Y  then  it  is  a  subit  of 
he  universal  binary  relation.  In  general,  A  <=  B  (A  is  a  subset  of  B)  means 
that  every  element  in  A  is  in  B  also. 

We  often t  write  xRy  to  mean  that  (z,  y)  e  R.  Similarly,  not  xRy  (it  is  false 
that  x  stands  in  the  relation  R  to  y)  means  that  (x,  y)  $  R.  ia  general  a&A 
means  that  a  is  not  an  element  in  A.  If  R  is  a  binary  relation  on  Y  then  for 
each  (x,  y)  m  the  universal  relation  either  xRy  or  not  xRy,  and  not  both. 

Because  we  are  dealing  with  ordered  pairs,  (x,  y)  is  not  the  same  as  (y,  z) 
unless  x  ~y.  Hence,  if  R  is  a  binary  relation  on  Y  and  if  x,  y  e  Y,  then 
exactly  one  of  the  following  four  cases  holds : 

1.  (xRy,  yRx), 

2.  (xRy,  not  yRx), 

3.  (not  xRy,  yRx), 

4.  (not  xRy,  not  yRx). 

Let  Ybe  the  set  of  all  living  people.  Let  R,  mean  “is  shorter  than,”  so  that 

xy  means  that  *  is  Sorter  than  y.  Case  (1)  is  impossible.  Case  (2)  holds 
when  *  is  shorter  than  y.  Case  (4)  holds  when  *  and  y  are  of  equal  heigh*  R 
is  an  example  of  a  weak  order.  6  "  1 

Next,  lei  R,  be  “is  the  brother  of”  (by  having  at  least  one  parent  in 
common).  Here  cases  (2)  and  (3)  are  impossible.  J!,  is  not  transitive  since  if 
xK*y  and  yR*z  11  does  not  necessarily  follow  that  xR^z.  (Why  ?) 


Some  Relation  Properties 

The  binary  relations  we  use  will  be  assumed  to  have  certain  properties.  A 
list  of  some  of  these  follows.  A  binary  relation  R  on  a  set  Yis 

pi.  reflexive  if  xRx  for  every  xe  Y, 

p2.  irreflexive  if  not  xRx  for  every  xe  Y, 

p3.  symmetric  if  xRy  =>  yRx,  for  every  x,y  e  Y, 

p4.  asymmetric  if  xRy  =>  not  yRx ,  for  every  x,yeY, 

p5.  antisymmetric  if  (xRy,  yRx)  =>x  =  y,  for  every  x,  y  e  Y 

p6.  transitive  if  (xRy,  yRz)  xRz,  for  every  x,  y,  ze  Y 

pi.  negatively  transitive  if  (not  xRy,  not  yRz)^  not  xRz,  for  every 
x,y,z  e  Y,  1 


Preference  at  a  Weak  Order 


11 


pS.  connected  or  complete  if  xRy  or  yRx  (possibly  both)  for  every  x,  y  e  Y, 
p9.  weakly  connected  if  x  5*  y  =>  (xRy  or  yRx)  throughout  Y. 

Several  other  properties  are  introduced  in  Section  2.4. 

An  asymmetric  binary  relation  is  irreflexive.  An  irreflexive  and  transitive 
binary  relation  is  asymmetric:  if  (xRy,  yRx)  then p6  gives  xRx,  which  violates 
p2.  It  is  also  useful  to  note  that  R  is  negatively  transitive  if  and  only  if,  for  all 

*»  y> 2  e  y , 

xRy  =>  {xRz  or  zRy).  (2.1) 

To  prove  this  suppose  first  that,  in  violation  of  (2.1),  ( xRy ,  not  xRz,  not 
zRy).  Then,  if  the  pi  condition  holds,  we  get  not  xRy,  which  contradicts 
xRy.  Hence  the  pi  condition  implies  (2.1).  On  the  other  hand,  suppose  the 
pi  condition  fails  with  (not  xRy,  not  yRz,  xRz).  Then  (2.1)  must  be  false. 
Hence  (2.1)  implies  the  pi  condition. 

The  relation  Rt  (shorter  than)  is  irreflexive,  asymmetric,  transitive,  and 
negatively  transitive.  If  no  two  people  are  of  equal  height,  is  weakly 
connected.  Rt  (brother  of)  is  symmetric. 

2.2  PREFERENCE  AS  A  WEAK  ORDER 

Binary  relations  that  have  or  are  assumed  to  have  certain  properties  are 
often  given  special  names.  In  this  section  we  shall  be  most  concerned  with 
three  types  of  binary  relations,  namely  weak  orders,  strict  orders,  and 
equivalences. 

Definition  2.1.  A  binary  relation  R  on  a  set  Y  is 

a.  a  weak  order  <=>  R  on  Y  is  asymmetric  and  negatively  transitive; 

b.  a  strict  order  o  R  on  Y  is  a  weakly  connected  weak  order; 

c.  an  equivalence  o  R  on  Y  is  reflexive,  symmetric,  and  transitive. 

The  relation  <  on  the  real  numbers  is  a  weak  order  and  also  a  strict  order 
since  x  <  y  or  y  <  *  whenever  *  jt  y  ;  =  on  the  real  numbers  is  an  equiva¬ 
lence,  since  x  =  x,  x  —  y  =>y  =  x,  and  (x  =  y,  y  =  2)  =>  x  =  2. 

An  equivalence  on  a  set  defines  a  natural  partition  of  the  set  into  a  class  of 
disjoint,  nonempty  subsets,  such  that  two  elements  of  the  original  set  are  in 
the  same  class  if  and  only  if  they  are  equivalent.  These  classes  are  called 
equivalence  classes.  Let 

R(x)  —  {y\y  g  Y  and  yRx}. 

If  R  is  an  equivalence  then  R(x)  is  the  equivalence  class  generated  by  x.  In  this 


12 


Preference  Orders  for  Countable  Sets 


case  you  can  readily  show  that  R(x )  =  R{y)  if  and  only  if  xRy.  Thus,  any 
two  equivalence  classes  are  either  identical  or  disjoint  (have  no  element  in 
common).  When  Ron  Y  is  an  equivalence,  we  shall  denote  the  set  of  equiva¬ 
lence  classes  of  Y  under  R  as  YjR. 

Preference  as  a  Weak  Order 

Taking  preference  <  as  basic  (read  x  <  y  as  x  is  less  preferred  than  y,  or 
y  is  preferred  to  x)  we  shall  define  indifference  ^  as  the  absence  of  strict 
preference : 

x~yo  (not  x  <  y,  not  y  <  x).  (2.2) 

Indifference  might  arise  in  several  ways.  First,  an  individual  might  truly  feel 
that,  in  a  preference  sense,  there  is  no  real  difference  between  x  and  y.  He 
would  just  as  soon  have  x  as  y  and  vice  versa.  Secondly,  indifference  could 
arise  when  the  individual  is  uncertain  as  to  his  preference  between  x  and  y. 
He  might  find  the  comparison  difficult  and  may  decline  to  commit  himself  to 
a  strict  preference  judgment  while  not  being  sure  that  he  regards  x  and  y  as 
equally  desirable  (or  undesirable).  Thirdly,  a :~y  might  arise  in  a  case  where 
the  individual  considers  x  and  y  incomparable  (in  some  sense)  on  a  preference 
basis. 

Asymmetry  is  an  “obvious”  condition  for  preference.  It  can  be  viewed  as 
a  criterion  of  consistency.  If  you  prefer  x  to  y,  you  should  not  simultaneously 
prefer  y  to  x. 

Transitivity  is  implied  by  asymmetry  and  negative  transitivity,  and  it 
seems  like  a  reasonable  criterion  of  coherence  for  an  individual's  preferences. 
If  you  prefer  *  to  y  and  prefer  y  to  z,  common  sense  suggests  that  you  should 
prefer  x  to  z. 

However,  the  full  force  of  weak  order  is  open  to  criticism  since  it  imparts 
a  rather  uncanny  power  of  preferential  judgment  to  the  individual,  as  can 
be  seen  from  (2.1).  To  see  how  (2.1)  might  fail,  suppose  that  in  a  funding 
situation  you  feel  that  $1000  is  about  the  best  allocation.  Your  preference 
decreases  as  you  move  away  from  $1000  in  either  direction.  Although  you 
prefer  $955  to  $950,  it  may  also  be  true  that  you  have  no  sure  preference 
between  $950  and  $1080  or  between  $955  and  $1080  Then  ($950  <  $955, 
$950  ~  $1080,  $955  ~  $1080)  in  violation  of  (2.1). 

In  this  example,  indifference  is  not  transitive.  Armstrong  (1950,  p,  122) 
speaks  of  intransitive  indifference  as  arising  from  “the  imperfect  powers  of 
discrimination  of  the  human  mind  whereby  inequalities  become  recognizable 
only  when  of  sufficient  magnitude.”  Later  sections  of  this  chapter  take 
account  of  such  limited  discriminatory  powers  by  not  requiring  ~  to  be 
transitive. 

Our  first  theorem  notes  several  consequences  of  weak  order,  including  the 


Preference  as  a  Weak  Order 


13 


transitivity  of  indifference.  For  this  theorem  and  for  later  work  we  shall 
define  preference-indifference  ^  as  the  union  of  <  and 

x  <  y  o  *  <  y  or  x~y.  (2.3) 

THEOREM  2.1.  Suppose  <  on  X  is  a  weak  order ,  being  asymmetric  and 
negatively  transitive.  Then 

a.  exactly  one  of  x  <  y,  y  <  z,  x  ~  y  holds  for  each  i,  ye  X; 

b.  <  is  transitive; 

c.  is  an  equivalence  (reflexive,  symmetric ,  transitive); 

d.  (x  <  y,  y  z)  =>  x  <  z,  and  (x  ~  y,  y  •<  z)  =>  x  <  z; 

e.  ^  is  transitive  and  connected; 

f.  with  < 1 2 3  on  X/~  ( the  set  of  equivalence  classes  of  X  under  ~)  defined  by 

a<'box<y  for  some  xea  and  yeb,  (2.4) 

<'  on  A'/~  is  a  strict  order. 

Proof.  Part  (a)  follows  from  asymmetry  and  (2.2).  For  (b),  suppose  x  <  y 
and  y  <z.  Then,  by  (2.1),  (x  <  z  or  z  <  y)  and  (y  <  x  or  x  <  z).  Since 
2  <  y  and  y  <  x  are  false  by  asymmetry,  *  <  z.  Thus  <  is  transitive. 
Suppose  x~y,  y  ~z,  and  not  x~z,  in  violation  of  the  transitivity  of 
Then,  by  (a),  either  x  <  z  or  z  <  x,  so  that  by  (2.1)  one  of  x  <  y,  y  <  z, 
z<y,  and  y  <  x  must  hold,  which  contradicts  x  ~y,  y~z,  and  (a). 
Hence  ~  is  transitive.  Suppose  as  in  (d)  that  x  <y  and  y~z.  Then,  by  (a) 
and  (2.1),  x  <  z.  The  second  half  of  (d)  is  similarly  proved.  For  (e)  the 
transitivity  of  <  follows  immediately  from  (b),  (c),  and  (d).  For  the  com¬ 
pleteness  of  <  suppose  to  the  contrary  that  (not  *  <  y,  not  y  x).  Then,  by 
(2.3),  (not  x  <  y,  not  x~y,  not  y  <  x),  which  violates  (a). 

Finally,  we  examine  the  properties  of  a  strict  order  for  <'  on  Xj~: 

1.  asymmetry.  If  a  <’  b  and  b  <!  a  then  x  <  y  and  y'  <  x'  for  some 
x,  x'  e  a  and  y,  y'  e  b,  with  x  ~  x'  and  y  ~  y'.  By  (d),  x'  <  y.  Again,  by 
(d),x'  <  y',  which  contradicts  y'  <  x' . 

2.  negative  transitivity.  Suppose  a  < '  b  with  xea,  ye  b,  and  x  <  y.  For 
any  c  e  X{^  and  any  zee,  (2.1)  implies  that  *  <  z  (in  which  case  a  <'  c)  or 
that  2  <  y  (in  which  case  c  <'  b). 

3.  weak  connectedness.  Suppose  a,b  e  Xj>^  and  a  9&b.  Then  a  and  b  are 
disjoint  so  that  if  xea  and  y  eb  then  not  *  ~  y.  Hence,  by  (a)  either  x  <  y 
or  y  <  x,  so  that  either  a  <'  b  or  b  <'  a.  ♦ 


14 


Preference  Orders  far  Coxntabt*  Sets 


An  Order-Preserving  Utility  Function 

THEOREM  2,2.  If  <  on  X  is  a  weak  order  and  Xj~  is  countable  then  there 
is  a  real-valued  function  u  on  X  such  that 

x<y  o  u(x)  <  u(y),  for  all  x,  y  e  X.  (2.5) 

The  utility  function  u  in  (2.5)  is  said  to  be  order-preserving  since  the 
numbers  u(x),  u(y), ...  as  ordered  by  <  faithfully  reflect  the  order  of 
x,  y, . . .  under  <.  Clearly,  if  (2.5)  holds,  then 

i v<y  o  v(x)  <  y(y),  for  all  x,  y  e  X, 

for  a  real-valued  function  v  on  X  if  and  only  if  [t>(z)  <  v{y)  o  u(x)  <  u{y)] 
holds  throughout  X.  In  the  next  section  we  shall  consider  the  case  where  <=> 
in  (2,5)  must  be  replaced  by  =>.  In  later  chapters  we  shall  meet  utility 
functions  with  properties  beyond  that  of  order  preservation. 

Under  the  conditions  of  Theorem  2.2,  (2.5)  implies  that,  for  all  x,  y  e  X, 
x  ~y  o  u(x )  =  «(y),  and  x  *^y  o  u{x)  u(p),  where  and  are 
defined  by  (2.2)  and  (2.3)  respectively. 

The  following  proof  of  the  theorem  is  similar  to  proofs  given  by  Birkhoff 
(1948,  p.  31)  and  Suppes  and  Zinnes  (1963,  pp.  26-28).  As  we  shall  see  in 
Chapter  3,  the  conclusion  of  the  theorem  can  be  false  when  X/~  is  uncount¬ 
able  (neither  finite  nor  denumerable). 

Proof  of  Theorem  2.2.  Assuming  the  hypotheses  of  the  theorem  we  shall 
assume  also  that  Xj~  is  denumerable.  The  A7' —  finite  proof  is  similar  and  is 
left  to  the  reader.  Let  the  elements  in  be  enumerated  as  alr  aitait...  and 

let  the  rational  numbers  be  enumerated  as  ru  r„  r„  ....  No  particular  <' 
ordering  (see  (2.4))  or  <  ordering  is  implied  by  these  enumerations.  We 
define  a  real- valued  function  u  on  Xj^  as  follows,  recalling  that  <'  as  in 
(2.4)  is  a  strict  order  on  Xj^. 

Set  ufa)  =  0.  For  am  it  follows  from  the  properties  for  <'  and  induction 
that  exactly  one  of  the  following  holds: 

1.  <'  am  for  all  i  <  m:  if  so,  set  u(a„ )  =  m, 

2.  an<'  at  for  all  /  <  m:  if  so,  set  u(am )  =  —m, 

3.  n,  <'  am  <'  a t  for  some  i,j  <  m  and  not  <' a*  <'«,), 

for  every  positive  integer  h  that  is  less  than  m  and  differs  from  /  and  j:  if  so, 
set  u(am)  equal  to  the  first  rk  in  the  enumeration  r„  ra,  ra, . . .  for  which 
u(a()  <  rk  <  u{aj).  Such  an  rk  exists  since  there  is  a  rational  number  between 
any  two  different  numbers. 

By  construction,  u(am)  pt  u(a <)  for  all  i  <  m,  and  a*  < '  at  u(a%)  <  «(ay) 
for  all  i,j  <,  m.  This  holds  for  every  positive  integer  m.  Hence  it  holds  on  all 


Preference  as  a  Strict  Partial  Order  15 

of  Xj~.  Finally,  define  u  on  X  by 

u(x)  —  u(a)  whenever  x  e  a. 

Equation  (2.5)  then  follows  provided  that,  when  a  <’  b,  x  <  y  for  every 
xea  and  y  e b,  which  follows  directly  from  (2.4)  and  Theorem  2.1(d).  ♦ 

As  you  will  easily  note,  if  (2.5)  holds,  then  <  on  X  must  be  a  weak  order. 
Hence  if  <  on  A"  is  not  a  weak  order  then  (2.5)  is  impossible  regardless  of  the 
size  of  X. 

23  PREFERENCE  AS  A  STRICT  PARTIAL  ORDER 

Throughout  the  rest  of  this  chapter  we  shall  look  at  cases  where  indifference 
is  not  assumed  to  be  transitive.  This  section  considers  the  case  where  <  is  a 
strict  partial  order. 

Definition  2.2.  A  binary  relation  R  on  a  set  Y  is  a  strict  partial  order  if 
and  only  if  it  is  irreflexive  and  transitive. 

Since  this  allows  (x~y,  y  z,  x  <  z)  when  <  on  I  is  a  strict  partial 
order,  ~  is  not  necessarily  transitive  and  therefore  may  not  be  an  equivalence. 
However,  a  new  relation  defined  as 

x  fa  y  o  (z  ~  z  o  y  ~  z,  for  all  z  e  X)  (2.6) 

does  turn  out  to  be  transitive  when  <  is  a  strict  partial  order,  x  fa  y  holds  if, 
whenever  x  is  indifferent  to  a  z  e  X,  y  also  is  indifferent  to  z,  and  vice  versa. 
For  comparison  with  Theorem  2.1  we  have  the  following. 

THEOREM  2.3.  Suppose  <  on  X  is  a  strict  partial  order ,  being  irreflexive 
and  transitive.  Then 

a.  exactly  one  of  x  <  y,y  <  x,x  fa  y,  (z  ~y,  not  x  ia  y)  holds  for  each 
x,  y  e  X; 

b.  fa  is  an  equivalence; 

c.  x  fa  y  <=>  (z  <  z  ■<£>  y  <  z  and  z  <  x  o  z  <  y,for  all  z  e  X); 

d .  (x  <  y,  y  fa  z)  =>  x  <  z,  and  (x  fa  y,  y  <  z)  =>  x  <  z; 

e.  with  <*  on  Xf  fa  ( the  set  of  equivalence  classes  of  X  under  fa)  defined  by 

a  < *  b  <=>  x  <  yfor  some  xea  and y  eb,  (2.7) 

<  *  on  Xj  fa  is  a  strict  partial  order. 

Proof  (a)  follows  from  asymmetry  (implied  by  irreflexivity  and  transi¬ 
tivity)  and  the  fact  that  x  fa  y  can  hold  only  if  x  ~  y.  For  ( b ),  the  reflexivity 


16 


Preftrtnct  Orders  for  Countable  Sots 


and  symmetry  of  «s<  follow  directly  from  (2.6)  and  the  reflexivlty  and  sym¬ 
metry  of  <-w.  Suppose  x  y  and  y  an  z.  Then,  by  (2.6),  if  x  r+>  t  then  y  ~  t 
and,  again  by  (2.6),  if  ^ ' —  /  then  t.  Hence  z**t=>z^t.  Conversely 
z  ^  t  =>  x  —  f ,  so  that  x  a*  z  as  desired  for  transitivity. 

For  part  (c)  suppose  first  that  x  an  y,  If  *  <  z  then  either  y  <  z  or  y  ~  z, 
for  if  z  <  y  then  x  <  y  by  transitivity  of  < .  But  if  y  r*  z  then  x  z  by  (2.6), 
which  contradicts  x  <  z.  Hence  x  <  z  y  <  z.  Similarly  y  <  z  =>  x  <  z. 

A  similar  proof  shows  that  z  <  x  o  z  -<  y.  (This  also  establishes  (d).)  On 
the  other  hand,  assume  that  the  right  part  of  (c)  holds.  Then,  if  x  ~  t,  it 
cannot  be  true  that  either  y  <  t  or  t  <  y  so  that  y by  (2.1)  and  the 
asymmetry  of  < .  Conversely  y  ^  t  =>x  ~  t.  Hence  x  m  y. 

For  (e),  we  cannot  have  a  <*  a  when  a  e Xl&t  for  then,  x  <  y  for  some 
x  and  y  for  which  x  y,  which  is  false  by  (a).  For  transitivity  suppose 
(a  <  *  b,  b  <  *  c),  Then  (as  <  y>  y  y',  y’  <  z)  for  some  x  e  a,  y,  y‘  e  bt  and 
z  e  c.  x  <  z  then  follows  from  id)  so  that  a  <  *  c.  ♦ 

Zorn’s  Lemma  and  Szpilrajn’s  Extension  Theorem 

Before  we  can  establish  a  utility-representation  theorem  for  the  case  where 
<  is  a  strict  partial  order  and  Xjf>a  is  countable,  we  need  to  prove  the  follow¬ 
ing  theorem,  due  to  Szpilrajn  (1930). 

THEOREM  2.4.  If  <  *  is  a  strict  partial  order  on  a  set  Y  then  there  is  a 
strict  order  <°  on  Y that  includes  <*,  so  that 

x<*y=>x<^°y,  forallx,yeY.  (2.8)  { 

? 

The  utility  theorem  given  later  as  Theorem  2.5  is  very  easily  proved  from 
Theorem  2.4  and  the  proof  of  Theorem  2.2. 

To  establish  Szpilrajn’s  theorem,  which  holds  regardless  of  the  size  of  Y, 
we  shall  need  an  axiom  of  set  theory  that  goes  by  the  name  of  Zorn’s  Lemma. 

ZORN’S  LEMMA.  Suppose  P  on  Y  is  a  strict  partial  order  and,  for  any 
subset  Z  of  Y  on  which  P  is  a  strict  order ,  there  is  a  y  e  Y  such  that  zPy  or 
z  —  y  for  all  zeZ.  Then  there  is  a  y*  e  Y  such  that  y*Px  for  no  xe  Y. 

Consider  the  real  numbers  in  their  natural  order  under  <.  Since  <  itself 
on  the  numbers  is  a  strict  order  but  there  is  no  number  y  such  that  x  <  y  or 
x  =  y  for  every  number  x,  the  “lemma”  does  not  imply  that  the  real  numbers 
have  a  maximal  element  under  < ,  as  of  course  they  do  not. 

Zorn’s  Lemma,  used  today  by  most  mathematicians,  is  an  assumption. 

Kelley  (1955,  pp.  31-36)  presents  other  axioms  that  are  equivalent  to  Zorn’s 


■***•*< ' oMmnMMM *1 1 . i&M  &&&!& 


Preftreact  as  a  Strict  Partial  Order  17 

Lemma.  One  of  these  is  the  Axiom  of  Choice:  if  8  is  a  set  of  nonempty  sets 
then  there  is  a  function / on  §  such  that  f(S)  e  S  for  each  S  e  S. 

Proof  of  Theorem  2.4.  If  <  *  is  a  strict  order,  there  is  nothing  to  prove. 
Suppose  then  that  <  *  is  a  strict  partial  order  and  that  xty  in  Y  are  such  that 
*  #  y,  (not  *  -<  *  y,  not  y  <*  x).  Define  <1  on  Y  thus: 

a<lboa<*b  or  else  [(o  <*  x  or  a  »  x),  (y  <*  bory  =  £>)}.  (2.9) 

Clearly,  a  <*  b=>a  C1  b,  and  x  < 1  y.  We  prove  first  that  C1  is  a  strict 
partial  order. 

A.  <l  is  irreflexive.  To  the  contrary  suppose  a  <l  a.  Then,  if  either 
(a  <*x,y  <*a)  or  ( a<*x,y  «*  a)  or  (c  =  x,y  <*  a),  we  get  y  <*  xt 
which  is  false.  Also,  a  <*  a  and  {a  **  x,  y  *  a)  are  false  by  assumption. 
Hence  a  <  *  a  is  false. 

B.  <l  is  transitive.  Assume  (a  <1  b,  b  <l  c).  If  (a  <*  i,  b  <*  c)  then 
a  <  *  c  so  that  a  <l  c.  If  (a  <  *  bt  (b  <  *  x  or  b  —  x)  and  (y  <  *  c  or  y  *  c)) 
then  a  <*  x  so  that  a  <l  c  by  (2.9).  If  ((a  < *  x  or  a  =  x)  and  (y  <*  b  or 
y  *  b),  b  <  *  c)  then  y  <*  c  and  hence  a  K1  c.  Finally,  if  neither  a  <  *  b  not 
b  <  *  c  then,  by  (2.9),  (y  <  *  b  or  y  «  b)  from  a  < 1  b  and  (b  <  *  x  or  b  =>  x) 
from  b  -C1  c,  which  are  incompatible  since  they  give  y  <  *  x  or  y  =  z,  which 
are  false.  Hence  this  final  case  cannot  arise. 

We  now  use  Zorn’s  Lemma.  With  A  £  B  o  A  is  a  subset  of  B,  we  define 
A  <=  B  o  (A  £  B,  not  B  s  A).  Let  31.  be  the  set  of  all  strict  partial  orders 
on  Y  that  include  <  *,  so  that  R  e  3t  (R  cn  Y  is  a  strict  partial  order  and 
<  *  £  R).  In  Zorn’s  Lemma  as  stated  above,  <=  takes  the  part  of  P  and  31 
takes  the  part  of  Y. 

Clearly,  c  on  31  is  a  strict  partial  order.  Let  S  be  a  subset  of  31  on  which  c 
is  a  strict  order.  (We  omit  the  trivial  case  where  8  =  0 .)  Let  S  be  the  set  of 
all  (*.  V)  that  are  in  at  least  one  Re  S:  that  is,  (x,  y)e  S  or  xSy  if  and  only  if 
(a:,  y)  e  R  or  xRy  for  some  Re  8.  Clearly,  R  £  S  for  every  jR  e  8.  To  apply 
Zorn’s  Lemma  we  need  to  show  that  5  e  31,  or  that  5  on  Y  is  a  strict  partial 
order: 

A.  S  is  irreflexive.  (x,  x)  $  S  since  ( x ,  x)  $  R  for  every  R  e  31. 

B.  S  is  transitive.  If  ( x ,  y)  e  S  and  (y,  z)e  S  then  ( x ,  y)  e  St  and  (y,  z)  e  St 
for  some  S,  and  St  in  8.  For  definiteness  suppose  Sx  £  Sg.  Then  (x,  y)  e  S, 
and  hence  (*,  z)  e  S2  by  transitivity,  so  that  (x,  2)  e  S'  by  the  definition  of  S. 

It  follows  from  Zorn’s  Lemma  that  there  is  a  <°  e  31  such  that  <°  c  R 
for  no  R  e  31.  Because  <°  is  in  3 1,  it  is  a  strict  partial  order.  To  show  that  it 
is  a  strict  order,  it  remains  to  note  that  <°  on  Y  is  weakly  connected,  for 
when  this  is  true  <°  must  be  a  strict  order.  (You  can  easily  show  that  a 


18 


Preference  orders  for  Countable  Sets 


weakly  connected  strict  partial  order  satisfies  (2,1),  or  negative  transitivity, 
and  is  thus  a  strict  order  by  Definition  2.1.)  Suppose  then  that  contrary  to 
weak  connectedness  there  are  x,  y  e  Y  with  x  y  and  (not  x  y,  not 
y  <°  x).  Then,  by  the  first  part  of  this  proof,  there  is  a  strict  partial  order  <1 
on  T  such  that  a  <.9b=>  a<lb,  and  x  y .  But  then  <°  <=  which 
contradicts  <°  c  A  for  no  R  e  IR.  Hence  <°  is  weakly  connected.  ♦ 

Another  Utility  Theorem 

With  defined  by  (2.6),  the  following  theorem  says  that  when  <  is 
irreflexive  and  transitive  and  Xj^s  is  countable,  numbers  can  be  assigned  to 
the  elements  of  X  so  as  to  faithfully  preserve  both  <  and  f=w.  However, 
because  ~  can  be  intransitive,  we  cannot  guarantee  that  u(x )  =  u(y)  when 
x  y  and  not  %  y.  We  might  have  any  one  of  u(x)  —  u(y),  u(x)  <  u(y), 
and  u(y)  <  u(x )  when  {x  ~  y,  not  x  as  y). 

THEOREM  2.5.  If  <  on  X  is  a  strict  partial  order  and  Xj^s  is  countable 
then  there  is  a  real-valued  function  u  on  X  such  that,  for  all  x,  y  e  X, 

x<y=>  u(x)  <  u(y)  (2.10) 

x  as  y  =>  u(x)  —  u(y).  (2.11) 

Proof.  By  Theorem  2.3(e),  <*  on  X[as  as  defined  in  (2.7)  is  a  strict 
partial  order.  By  Theorem  2.4,  there  is  a  strict  order  <°  on  X(as  that  includes 
<  *.  With  XI  as  countable,  the  proof  of  Theorem  2.2  guarantees  a  real-valued 
function  u  on  XjAs  such  that  a  <°  b  o  u(a)  <  u(b),  for  all  a,  be  XjAs.  With 
a  eXjAs,  set  u(x )  =  u{a)  whenever  x  e  a.  Then,  if  x  as  y,  u(x)  —  u(y),  so 
that  (2.1 1)  holds.  And  if  x  <  y  with  x  e  a  and  y  e  b  then  a  <  *  b  by  (2.7)  and 
Theorem  2.3(d):  hence  a  <°  b  so  that  u(a)  <  u(b)  and  u(x)  <  u(y).  ♦ 

2.4  ORDERED  INDIFFERENCE  INTERVALS 

There  are  other  interesting  assumptions  for  preferences  that  add  things  to 
strict  partial  order,  but  still  retain  the  possibility  of  intransitive  indifference. 
Two  such  conditions  were  introduced  into  preference  theory  by  Luce  (1956). 
They  are  stated  here  in  the  form  given  by  Scott  and  Suppes  (1958,  p.  117). 

/?10.  (x  <  y,  z  <  w)  =>  (x  <  w  or  z  <  y),  for  all  x,  y,z,we  X. 
p  11.  (*  <  y,y  <  z)=>  (x  <  h’  or  tv  <  2),  for  all  x,  y,z,we  X. 

It  is  easily  seen  that  if  <  is  irreflexive  and  either /?10  or p\\  holds  then  <  is 
transitive.  When  <  is  a  strict  partial  order,  the  only  instances  of  plO  and  pi  l 
that  are  not  already  implied  by  irreflexivity  and  transitivity  are  those  illus¬ 
trated  in  Figure  2.1.  For  plQ,  we  have  the  case  shown  on  the  left  of  the  figure 


Ordered  Indifference  intervals 


19 


y 

* 


w 

M 


X 

pio 


increasing 

preference 


pii 


Figure  2.1  Cases  not  covered  by  irreficxivity  and  transitivity. 

where  x  <  y,  z  <  w,  x  ~  z,  and  y  ~  w,  with  2  5*  z  and  y  5*  w.  In  a  sort  of 
cross-connectedness,  plO  says  that  at  least  one  of  the  dashed  lines  must  be 
strict  preference:  we  can’t  have  both  x  ~  tv  and  y  ~  z.  For  pll  we  get  the 
picture  on  the  right  of  the  figure  where  *  <  y  <  2  and  w  ~  y  with  w  ^  y. 
Here  p\\  says  that  at  least  one  of  the  dashed  tv-lines  must  represent  strict 
preference:  w  can’t  be  indifferent  to  each  of*,  y,  and  z. 

Conditions  plO  and  pi  1  may  seem  reasonable  if  the  elements  of  X  are 
naturally  ordered  and  preference  is  either  nondecreasing  or  nonincreasing 
as  one  proceeds  along  the  natural  order.  For  example,  if  you  prefer  your 
coffee  black  it  seems  fair  to  assume  that  your  preference  will  not  increase  as 
x,  the  number  of  grains  of  sugar  in  your  coffee,  increases.  You  might  well  be 
indifferent  between  and  2  =  1,  between  x  —  l  and  *  =  2, . . . ,  but 

of  course  will  prefer  2  =  0  to  2  =  1000.  Although  ^  is  not  transitive  here, 
p  10  and  pll  would  probably  hold  along  with  irreflexivity. 

However,  if  there  are  several  factors  that  influence  preference  or  if  there 
is  only  one  basic  factor  along  which  preference  increases  up  to  a  point  and 
decreases  thereafter,  <  may  fail  to  satisfy  the  cases  of  plO  and  pi  1  shown  in 
Figure  2.1.  To  continue  with  coffee  and  sugar,  suppose  you  like  about  1000 
grains  of  sugar  in  your  coffee.  The  left  part  of  Figure  2.2  shows  a  case  where 
it  might  be  true  that  *  <  y,  z  <  w,  *  ~  tv,  and  y  ^z,  in  violation  of  plO. 
The  right  part  of  the  figure  suggests  that  pi  1  may  fail  with  x  <  y  <  z  and 


Figure  2.2  “Failures"  of  plO  and  pi  1  for  single-peaked  preferences. 


20 


Preference  Orders  for  Countable  Sets 


h'^j,  w  r** y ,  w  ~  z.  We  could  expect  both plO  and p\\  to  hold  on  a  fixed 
side  of  your  peak  or  ideal  (Coombs,  1964)  but  there  seems  to  be  little  reason 
to  suppose  that  they  hold  for  the  cases  illustrated,  '"'he  funding  situation  in 
Section  2.2  gives  another  peaked  situation  where  pi  and  pi  1  might  not  hold. 

Definition  2.3.  A  binary  relation  is  an  interval  order  if  it  is  irreflexive  and 
satisfies  plO,  and  a  semiorder  if  it  is  irreflexive  and  satisfies  plO  and  pll. 

The  term  “semiorder”  was  introduced  by  Luce  (1956)  and  is  now  standard 
terminology.  The  way  I  use  “interval  order”  is  not  standard,  but  seems 
reasonable  in  view  of  Theorem  2.7. 

Interval  Orders 

In  the  rest  of  the  chapter,  *=*  is  defined  by  (2.6).  For  interval  orders 
( p2,pl0 )  we  shall  use  the  following: 

x  <x  y  <=>  (a;  ~  z,  z  <  y)  for  some  z  e  X  (2.12) 

x  <2  y  o  (x  <  z,  z  ™  y)  for  some  zeX.  (2.13) 

THEOREM  2.6.  If  <  on  X  is  an  interval  order  then  each  of  <x  and  <2  is  a 
weak  order,  andx  zz  y  <=>  (x  y ,  x  —2  y),  where  x  y  o  {not  x  <}  y,  not 
y  <*x). 

Proof.  The  final  assertion  follows  from  (2.6).  To  prove  asymmetry  for 
C1  suppose  to  the  contrary  that  (x  <*  y,  y  <*  *).  Then  (x  ~  z}  z  <  y)  and 
(y  ^  w,  w  <  x)  for  some  z,  w  e  X,  which  contradict  plO.  To  establish  nega¬ 
tive  transitivity  suppose  to  the  contrary  that  (not  x  <*  y,  not  y  <lz,x  <*  z). 
By  x  C1  z,  (x  ~  t,  t  <  z)  for  some  t  e  X.  From  x  ~  /  and  not  x  <*  y,  (2.12) 
implies  not  t  <  y.  From  t  <  z  and  not  y  <J  z,  (2.12)  yields  net  t  ~  y.  Hence 
y  <  /.  But  then,  by  transitivity,  y  <  z  which  implies  y  <1z,  contradicting 
not  y  <l  Hence  C1  is  negatively  transitive.  The  proof  for  <2  is  similar  and 
is  left  to  the  reader.  ♦ 

THEOREM  2.7.  If  <.  on  X  is  an  interval  order  and  Xj*&  is  countable  then 
there  are  real-valued  functions  u  and  a  on  X  with  o(x)  >  0  for  all  x  e  X  such 
that 

x  <  y  o  u{x)  +  ar(x)  <  u(y),  for  all  x,  y  6  X.  (2.14) 

Note  also  that  if  (2.14)  holds  then  plO  must  hold. 

Theorem  2.7  is  like  the  weak  order  Theorem  2.2  with  the  addition  of  a 
“vagueness”  function  a  which  allows  for  intransitive  indifference.  The 
indifference  interval  for  x  is  l(x)  =  [u(x)y  u(x)  -f-  o(x)].  By  (2.14),  I{x)  is 


Ja‘  -*i»‘  **, 


Ordered  lediffertmce  Istermls  21 

wholly  to  the  left  of  I{y)  if  and  only  if  x  <  y.  If  two  intervals  intersect  then 
their  elements  are  indifferent.  As  seen  by  the  failure  (x  <  y  <  zt  w  ~  x, 
w^y,  w  ~2)  of  pll,  one  indifference  interval  may  lie  entirely  within  another 
interval:  in  the  case  at  hand,  I(y)  must  be  shorter  than  I(w). 

Proof  of  Theorem  2.7.  Let  <  on  X  be  an  interval  order.  Using  the  Axiom 
of  Choice  let  Y  consist  of  one  element  from  each  equivalence  class  in  Xjaa. 
For  each  re  Y  let  x*  denote  an  artificial  element  that  corresponds  to  x,  with 
Y *  the  set  of  artificial  elements.  Define  <3onfU  K*  (the  set  of  elements  in 
Y  or  Y *)  as  follows: 


r 

x  <zy  ox  <}y 

(2.15) 

x*  <?  y*  o  x  <}  y 

(2.16) 

x*  <3  y  o  x  <  y 

(2.17) 

x  <3y*  o  x  ^  y 

(2.18) 

where  <  =  <  U  ^  as  in  (2.3).  We  prove  that  <3  on  Yu  Y*  is  a  weak 
order. 

Asymmetry.  We  want  a  <3  b=>  not  b  <8  a.  If  (a,  b)  =  (x,  y)  or  (a,  b)  =» 
(r*,  y*)  then  asymmetry  follows  from  Theorem  2.6  and  (2.15)  or  (2.16). 
Suppose  (a,  b)  =  (a?*,  y)  and  (a  <8  b,  b  <sa).  Then  (x  <  y,  y  <  x)  by 
(2.17)  and  (2.18),  which  is  impossible. 

Negative  Transitivity.  We  shall  suppose  that  (not  a  <3  b,  not  b  <3  c, 
a  <3  c)  and  obtain  a  contradiction.  The  cases  for  (a,  b,  c )  =  (x,  y,  z)  and 
{a,  b,  c)  =  (x*,  y*,  z*)  are  covered  by  Theorem  2.6.  The  others  follow. 

1.  ( x ,  y,  z*).  Then  not  x  <l  y,  z  <  y>  x  z.  If  x  ~  z  then  x  <x  y,  which 
contradicts  not  x  <*  y.  If  x  <  3  then  x  <  y,  which  implies  x  <ly,  a  contra¬ 
diction. 

2.  (x,  y*,  z).  Then  y  <  x,  z  ^  y,  x  C1  z.  From  the  last  of  these,  ( x  ~  t, 
t  <  z),  which  along  with  {y  <  x,  z  ^  y)  contradicts  plO. 

3-  (**,  y> z)-  Then  y  x,  not  y  <*  z,  x  <  z.  Similar  to  Case  1. 

4.  ( x ,  y*,  2*).  Then  y  <  x,  not  y  <2  z,  x  ^  z.  If  x  ~  z  then  y  <2  z,  and  if 
x  <  2  then  y  <  z  and  hence  y  <z  z,  contradicting  not  y  <*  z. 

5.  (x*,  y,  z*).  Then  y  x,  z  <  y,  x  <2  z.  From  the  last  of  these,  ( x  <  t , 
t  ~  z),  which  along  with  (y  x,  z  <  y)  contradicts  p\0. 

6.  (**,  y*,  z).  Then  not  x  <2y,  z  ^  y,  x  <  z.  Similar  to  Case  4. 

Assume  that  X/fv  is  countable.  Then  Y  U  Y*  is  countable  and  by 
Theorem  2.2  there  is  a  real-valued  function / on  Y  U  Y*  such  that,  for  all 
b,cs  YU  Y*, 


b  <3cof(b )  < /(c). 


22 


Preference  Orders  for  Comntablt  Sets 


For  x  e  Y  let  u(x)  =  /(ar)  and  n(x)  =  /(a?*)  —  /(*).  Then,  using  (2.17), 
a;  <  y  o  w(a?)  -f-  <r(z)  <  u(y),  for  all  x,yeY.  Since  *  <s  x *  by  (2.18),  <?  >  0. 
Let  «(«)  *  u(y)  and  a{x)  =  a(^)  whenever  ?*#!/  and  y  e  Y.  Then  (2.14) 
follows  from  Theorem  2.6  and  Theorem  2.3(d).  ♦ 

Semiorders 

On  adding  pi  l  to  p2  and  plO  we  obtain  the  following  extension  of  Theorem 
2.6. 

THEOREM  2.8.  Suppose  <  on  X  is  a  semiorder  and ,  with  <}  and  <* 
defined  by  (2.12)  and  (2.13),  <°  on  X  is  defined  by 

x  <P y  ox  <xy  or  x  <2 y,  for  all x,y  € X.  (2.19) 

Then  <°  on  X  is  a  weak  order. 

Proof.  Asymmetry.  ( x  <*  y,  y  C1  x )  and  (x  <2  yy  y  <2  x)  are  prohibited 
by  Theorem  2.6.  Suppose  (x  <*  y,  y  <*  x).  Then  (x  ~  z,  z  <  y)  and  (y  <  w, 
w^x)  for  some  z,  w  e  X,  which  violates  pi  1. 

Negative  Transitivity.  By  (2.19),  not  x  y  (not  x  <}y,  not  x  <}y) 
and  not  y  <°  z  =>  (not  y  <x  z,  not  y  <2  z).  Therefore,  by  the  negative 
transitivity  of  C1  and  <2,  (not  x  <1z,  not  x  <2z),  so  that  not  x  <°z  by 

(2.19) .  ♦ 

When  Xjzn  is  finite  and  <  is  a  semiorder,  it  is  possible  to  make  tr  in  (2.14) 
constant  on  X.  A  constructive  proof  of  this  is  given  in  Scott  and  Suppes 
(1958)  or  in  Suppes  and  Zinnes  (1963).  An  alternative  proof,  similar  to  that 
given  by  Scott  (1964),  uses  the  Theorem  of  The  Alternative  which  will  be 
introduced  in  Chapter  4.  Exercise  4.18  gives  an  outline  of  the  alternative 
proof  of  the  following  theorem. 

THEOREM  2.9.  Suppose  <  on  X  is  a  semiorder  and  XI &  is  finite.  Then 
there  is  a  real-valued  function  u  on  X  Such  that 

x  <  y  <=>  u(x)  +  1  <  «(y),  for  all  x,yeX.  (2.20) 

With  an  appropriate  change  in  u,  any  positive  number  could  be  used  in 

(2.20)  in  place  of  1. 

2.5  SUMMARY 

A  binary  relation  on  a  set  is  a  weak  order  if  it  is  asymmetric  and  negatively 
transitive.  Defining  indifference  as  the  absence  of  strict  preference,  ~  on 
X  is  an  equivalence  (reflexive,  symmetric,  transitive)  when  <  on  A'  is  a  weak 


Exercises 


23 


order.  If  the  set  X/~  of  equivalence  classes  of  X  under  ~  is  countable  when 
<  is  a  weak  order  then  utilities  u(x),  uiy), . . .  can  be  assigned  to  the  elements 
in  X so  that  x  <  y  o  u(x)  <  u(y).  This  gives  x~yo  u(x)  «  u(y)  also. 

The  preference  relation  is  a  strict  partial  order  when  it  is  irreflexive  and 
transitive.  In  this  case  indifference  may  be  intransitive  but  as,  defined  by 
xsnyo(x~zoy~z,  for  all  z  e  X),  is  an  equivalence.  When  <  on  X 
is  a  strict  partial  order  and  Xj™  is  countable,  utilities  can  be  assigned  so 
that  u(x)  <  u(y)  if  x  <  y,  and  u(x)  —  u(y)  if  x  &  y. 

Interval  orders  and  semiorders  lie  between  strict  partial  orders  and  weak 
orders.  When  <  on  X  is  an  interval  order  or  a  semiorder  and  Xjnn  is  count¬ 
able,  we  get  x  <  y  o  I(x)  is  wholly  to  the  left  of  I(y),  where  /  is  a  function 
that  assigns  an  interval  of  real  numbers  to  each  x  e  X.  If  <  is  a  semiorder 
and  X 7«j  is  finite  then  all  indifference  intervals  can  be  made  to  have  the  same 
length. 


INDEX  TO  EXERCISES 

1.  Denumerable  sets.  2.  Binary  relations.  3.  Weak  order.  4.  Quasi  order.  5-7.  Asym¬ 
metric  transitive  closure.  8.  Equivalence.  9.  Partitions.  10-13.  Interval  orders  and  semi¬ 
orders.  14.  Choice  sets.  15-16.  Cartesian  products.  17-18.  Lexicographic  orders.  19. 
Theorem  2.2.  20-21.  Sets  and  relations. 


Exercises 

1.  Prove  that  the  following  sets  are  denumerable:  (e)  {2, 4, 6, . . the  set  of  all 

positive,  even  integers;  (A)  ,  —2,  —1,0,  1,2,...};  (c)  the  set  of  all  positive 

rational  numbers  (Hint:  place  these  in  a  two-dimensional  array  with  1/1,  1/2, 
1/3, ...  in  the  first  row,  2/1,  2/2,  2/3, ...  in  the  second  row,  and  so  forth);  ( d )  the 
set  of  all  rational  numbers. 

2.  With  Y  the  set  of  all  living  people,  identify  the  meaning  of  cases  (1)  through 
(4)  in  Section  2.1  and  state  which  of  properties  p\  through  p9  hold  for  the  binary 
relation  identified  as: 

a.  “is  a  blood-line  descendant  of,” 

b.  “is  married  to”  (assuming  monogamy  throughout  society), 

c.  “is  married  to”  (admitting  polygamy), 

d.  “is  as  old  as,” 

e.  “has  fathered  or  mothered  the  same  number  of  children  as.” 

3.  Suppose  <  on  X  is  transitive  and  connected,  and  ■<  and  ~  are  defined  as 
follows:  x  -<  yo  not  y  x;  x  ~  y  <=>  (x  <  y,  y  <  x).  Prove  that  ■<  is  a  weak 
order  and  that  ~  is  an  equivalence. 


24 


Preference  Orders  for  Countable  Sets 


4.  <  on  X  is  a  quasi  order  if  it  is  reflexive  and  transitive.  Prove  that  if  <  on  X 
is  a  quasi  order  and  -<,  ~  are  defined  as  in  Exercise  3  then 

a.  on  X  is  an  equivalence 

b.  <  on  X  is  a  strict  partial  order 

c.  (x  <—  y,  y  -<  z)  =>  x  -<  z,  (*  -<  y,  y  ~  z)  =>  x  -<  z. 

5.  If  -<  on  X  is  a  binary  relation,  the  transitive  closure  of  -<  is  defined  as 
follows: 

x  -<<?/<=>  x  <  y  or  there  are  x1}  z2, . . . ,  xw  g  A'  such  that 

x<x1,x1<x2>...  >  xm _j  -<  xm,  -<  j/. 

Prove  that  if  is  asymmetric  then  <!  is  a  strict  partial  order. 

6.  ( Continuation .)  Suppose  A'  is  countable.  Use  Theorem  2.5  to  prove  that  there  is 
a  real-valued  function  a  on  A'  that  satisfies  (2.10)  if  and  only  if  the  transitive  closure 
of  -<  is  asymmetric. 

7.  ( Continuation .)  Give  an  example  of  a  -<  on  X  whose  transitive  closure  is 
asymmetric  and  with  u  satisfying  (2.10)  and  defined  by  (2.6)  it  is  not  possible 
for  u  to  satisfy  (2.11)  also. 

8.  Using  (2.2)  and  (2.6)  prove  that  is  an  equivalence  when  -<  on  Af  is  asym¬ 
metric. 

9.  A  partition  of  a  set  Y  is  a  set  of  nonempty  subsets  of  Y  such  that  each  xe  Y 
is  in  exactly  one  element  of  the  partition.  Prove  that  any  partition  of  Y  is  a  set  of 
equivalence  classes  under  some  equivalence  relation  on  Y. 

10.  Prove  that  (p2,pl0)  =>  p6  (transitivity)  and  that  (p2,pll)  =>  p6. 

11.  (From  Fred  Roberts.)  (X,  ~)  is  an  interval  graph  o  a  real  interval  I(x)  can 
be  assigned  to  each  xe  X  so  that,  for  all  x,  y  e  X,  x  ~  y  if  and  only  if  /(x)  and 
I(y)  intersect.  Prove  that  if  X  is  countable  then  -<  on  X  is  an  interval  order  if  and 
only  if  -<  is  transitive  and  (X,  ~)  is  an  interval  graph. 

12.  ( Continuation .)  Roberts  (1969).  Suppose  X  is  finite.  Prove  that  -<  on  X  is  a 
semiorder  if  and  only  if  {X,  <— )  is  an  interval  graph  and  (x  ~  w,  y  ~  w,  z  ~  w,  not 
x  ~  y,  not  s/  —  x,  not  x  ~  2)  is  false  whenever  x,  y,  z,  and  w  are  in  X. 

13.  Show  that  if  (2.20)  holds  when  C  is  a  semiorder  and  X  is  finite,  then  for 
any  a  >  0  there  is  a  real-valued  function  va  on  X  such  that,  for  all  x,  y  e  X,  x  -< 
y  <=>  ra(x)  4-  a  <  vjy). 

14.  Arrow  (1959).  Let  F  (the  choice  function)  be  a  function  that,  for  every  non¬ 
empty  subset  Y  of  X,  assigns  a  nonempty  subset  of  Y  to  Y,  so  that  F(  Y)  £  Y  and 
F(Y)  9*  0  for  every  Y  £  X  such  that  Y  ^  0 .  Consider  the  following  conditions 
on  F. 

TRANSITIVITY,  y  G  F({x,  y}),  z  e  F{{y,  z})=>ze  F({x,  s}). 

EXTENSION,  F(K)  =  {x:xg  Y  and  xeF({x,y})  for  every  ye  Y},  provided 
that  the  set  {x :  •  •  •}  is  not  empty. 

TE.  Ifx.ye  Y,x,  ye  Y*,  x  e  F(  Y),  and  y  $  F(Y)  then  y$F{  Y*). 

Interpret  each  of  these  conditions  in  your  own  words  when  F(  Y)  is  the  individual’s 


Exercises  25 

set  of  most  preferred  elements  in  Y,  that  is  his  choice  set.  Then  suppose  that  X  is 
finite  and  prove  that  Transitivity  and  Extension  hold  if  and  only  if  TE  holds.  In 
doing  this  it  may  help  to  note  that,  with  *  <  y  o  y  e  F({x,  y}),  <  is  transitive 
and  connected  when  Transitivity  holds,  and  that,  when  th:  first  two  conditio.* 
hold  then  F(  Y)  *  {x :  x  e  Y  and  y  =<  *  for  all  y  e  Y). 

15.  Show  that  {(xj,  x4) :  x1  and  xa  are  positive  integers)  is  denumerable. 

16.  ( Continuation .)  The  Cartesian  product  of  sets  X\  and  X2  is  Xt  x  X2  = 
{(jcj,  Xj) :  xt  €  Xx  and  x2  e  AV}.  Use  the  preceding  results  to  show  that  Xx  x  X2  is 
denumerable  if  both  Xx  and  X2  are  denumerable. 

17.  ( Continuation .)  With  X  —  Xx  x  X*  let  Xt  ={1,2,...}  and  let  X2  be  the 
set  of  all  rational  numbers  between  0  and  1  inclusive.  Define  -<  on  X  by  (xx,  Xj)  •< 
(Vx,  y2)  <=>  xx  <  yx  or  (xt  =  ylt  x2  <  yz).  (This  weak  order  is  a  lexicographic  order 
since  it  orders  the  pairs  of  numbers  like  two-letter  words  would  be  ordered  in  a 
dictionary.)  Write  out  an  explicit  formula  for  u  on  Xx  x  X2  that  satisfies  (2.5). 

18.  ( Continuation .)  Let  -<  be  defined  as  in  the  preceding  exercise,  except  that  Xx 
is  the  rationals  between  0  and  1  inclusive  and  X2  ={1,2,...},  the  positive  integers. 
Theorem  2.2  says  that  there  is  a  real-valued  u  on  X  —  Xt  x  X2  that  satisfies  (2.5). 
Can  you  write  out  an  explicit  formula  for  u  on  Xx  x  X2  that  satisfies  (2.5)?  If  not, 
explain  why  not. 

19.  Prove  Theorem  2.2  when  is  finite. 

20.  Let  A  £  B  mean  that  A  is  a  subset  of  B  and  A  =  B  if  and  only  if  A  £  £  and 
B  £  A.  A  u  B  is  the  set  of  all  elements  in  A  or  in  B,  and  A  n  B  is  the  set  of  all 
elements  in  both  A  and  B.  Let  0  denote  the  empty  set  (set  with  no  elements). 
With  Y  a  set,  let  A  =  {(x,  x):xe  F};  if  R  is  a  binary  relation  on  Y  let  R'  = 
{{y,  x) :  (x,  y)  e  R} ;  if  K  and  S  are  binary  relations  on  Y,  let  RS  —  {(*,  z) :  xRy  and 
ySz  for  some  y  e  Y}.  Express  pi  through  pi  1  of  the  chapter  in  terms  of  these  defi¬ 
nitions.  For  example,  pi  can  be  written  as  A  z  R. 

21.  ( Continuation .)  Verify  that  when  the  given  sets  are  binary  relations  on  a  set 
Y,  then 

a.  A'  =  A;  0'  =  0  [0  is  the  empty  binary  relation] 

b.  (A  u  BY  =  A'  v  B';(A  n  B)'  =  A'  n  B' 

c.  (AB)C  =  A(BC) 

d.  A0  =  0A  =  0 

e.  A&  =  A  A  =  A 

f  A  £  B  and  C  £  D  imply  AC  £  BD. 

(See  Chipman  (1960)  for  additional  material  of  this  kind.) 


rt 


Chapter  3 


UTILITY  THEORY  FOR 
UNCOUNTABLE  SETS 


This  chapter  extends  the  theory  of  preference-preserving  utility  functions  to 
include  uncountable  sets.  A  new  condition  of  order  denseness  is  used  for  this 
purpose.  After  proving  basic  theorems  for  weak  orders  and  strict  partial 
orders  we  shall  consider  preferences  on  subsets  of  /i-dimensional  Euclidean 
space.  The  chapter  concludes  with  a  discussion  of  continuous  utility  functions. 

An  uncountable  set  is  a  set  that  is  not  countable:  it  is  neither  finite  nor 
denumerable.  The  following  examples  introduce  some  other  new  terms. 

1.  The  set  of  all  real  numbers  is  uncountable.  This  set,  denoted  by  Re  or 
El,  is  one-dimensional  Euclidean  space.  The  intervals  of  numbers  [a,  b]  = 

^  a;  <;  £},  [a,  b)  *=  {x:a  <,  x  <  b},  (a,  b]  =  {x\a  <  x  <,  b},  and 

(a,  b)  =  {x.a  <  x  <  b}  are  uncountable  when  a  <_b.  [a,  b ]  is  a  closed 
interval:  (a,  b)  is  an  open  interval,  (a,  b)  is  also  used  to  denote  an  ordered 
pair  of  elements.  The  context  should  clarify  the  usage. 

2.  The  set  {(a^,  x2, . . .  ,  xn):x.  e  Re  for  i  —  1, . . . ,  denoted  as  Re"  or 
En  and  called  n-dimensional  Euclidean  space ,  is  uncountable.  E2  is  the  real 
plane.  In  the  vector  (x2,  x2, . . .  ,xn),  the  /th  component  is  x(. 

3.  The  set  {(xlt  x2, . .  ,):xt ■  €  (0,  1}  for  /  —  1,2,...}  is  uncountable. 
Although  {(a?!,  x2, . . .  ,  xn):x(  e  {0, 1}  for  i  =  1,2, ...  ,n}  is  finite  for  each 
rt,  the  given  denumerable-dimensional  set  is  uncountable  (and  not  denumer¬ 
able).  On  the  other  hand,  {(x1(  x2):xf  e  {1,  2,  . .  .}  for  /  =  1,  2}  is  denumer¬ 
able. 


3.1  THE  DENSENESS  AXIOM  AND  WEAK  ORDERS 

We  shall  now  extend  Theorem  2.2  to  cover  the  case  where  Xj ~  may  not 
be  countable.  To  do  this  we  shall  introduce  an  assumption  concerning  the 
concept  of  order  denseness. 


26 


The  Denseness  Axiom  and  Weak  Orders 


tt 


Definition  3.1.  Let  R  be  a  binary  relation  on  a  set  T.  Then  Z  £  Y  is 
Reorder  dense  in  Y  if  and  only  if,  whenever  xRy  and  x  and  y  are  in  Y  but  not 
2,  there  is  a  z  e  Z  such  that  (xRz,  zRy ). 

Since  there  is  a  rational  number  between  any  two  distinct  real  numbers, 
the  countable  set  of  rational  numbers  is  <-order  dense  in  Re.  For  the 
following  theorem,  <'  on  Xj~  is  defined  by  (2,4), 

THEOREM  3,1.  There  is  a  real-valued  function  u  on  X  such  that 

x  <  y  <=>  u(x)  <  u(y),  for  all  z,yeXt  (3.1) 

if  and  only  if  <  on  X  is  a  weak  order  and  there  is  a  countable  subset  of  Xj^ 
that  is  <  ’-order  dense  in  Xj^. 

Unfortunately,  the  countable  order  denseness  condition  does  not  have  a 
simple,  intuitive  interpretation.  To  see  how  this  condition  can  fail,  suppose 
X  —  Re2  with  <  the  lexicographic  order 

(xu  z2)  <  (t/„  yz)  ox1  <yt  or  (zt  »  ylt  x2  <  y a). 

Then  Xj~  —  {{x}:z  e  X},  so  that  {x}  <'  {y}  o  x  <  y.  With  ^  fixed  it  takes 
a  denumerable  subset  of  Re  to  obtain  an  < -order  dense  subset  on  {«!}  x  Re, 
But  there  is  an  uncountable  number  of  such  xx  and  it  follows  that  no  count¬ 
able  subset  of  Re2  is  <  -order  dense  in  Re2. 

For  another  example  let  X  =  [—1,  1).  The  absolute  value  of  x,  written 
]g|,  is  defined  by  |x|  =  x  if  x  ^  0,  \x\  =  — *  if  x  <  0.  Define  <  on  X  by 

x  <  y  o  |x|  <  |jr|  or  (\x\  *  \y\,  x  <  y). 

Suppose  Y  is  <-order  dense  in  [—1, 1).  With  x  e  (0, 1],  —x  <  x  and  there 
is  no  y  with  \y\  5*  jx|  such  that  —  x  <  y  <  *.  Hence  either  —a;  or  a:  must  be 
in  Y  for  each  xe(0, 1],  Thus,  every  <-order  dense  subset  Y  of  [—1,  1] 
contains  a  subset  that  is  in  one-to-one  correspondence  with  (0,  1],  which  is 
uncountable. 

Proof  of  Theorem  3.1 

Before  proving  the  theorem,  several  additional  notions  will  be  defined.  If 
A  and  B  are  sets,  the  union  A  u  B  of  A  and  B  is  the  set  of  all  elements  in  A 
or  B.  The  relative  difference  A  —  B  is  the  set  of  all  elements  in  A  but  not  B, 
Let  A  be  a  set  of  numbers  all  of  which  are  less  than  some  number  not  in  A. 
Then  the  least  upper  bound  or  supremum  of  A  is  the  smallest  number  that  is  as 
large  as  every  number  in  A : 

sup  A  =  smallest  y  such  that  x  <,  y  for  all  x  e  A. 

If  all  numbers  in  A  exceed  some  number  not  in  A  then  the  greatest  lower 


28  Utility  Theory  far  Uncauxtabie  Sets 

bound  or  infimum  of  A  is  the  largest  number  that  is  as  small  as  every  number 
in  A : 

inf  A  »  largest  y  such  that  y  <,  x  for  all  x  e  A. 

For  example,  sup  {1,2, 3}  *  3,  inf  {1, 2,  3}  =  1,  sup  (0, 1)  =  1  and 
inf  (0, 1)  s»  0.  In  the  last  two  cases  sup  and  inf  are  not  in  A. 

Proof  of  Necessity.  Let  (3.1)  hold.  Then  <  on  X  must  be  a  weak  order, 
and  <'  on  Xj~  is  a  strict  order,  with  a  <r  bo  u(a)  <  u(b),  where  u(a )  « 
u(x)  whenever  *  e  a.  Let  C  be  the  denumerable  set  of  closed  intervals  in  Re 
with,  distinct,  rational  endpoints.  For  each  /  €  C  that  contains  some  u(a )  for 
a  e  Xl~,  select  one  such  a.  Let  A  be  the  subset  of  Xj~  thus  selected.  A  is 
countable.  Next,  let 

K  —  {{bt  c).b ,  c  e  X/~  —  A,b  <'  c,b  <'  a  <'  c  for  no  a  e  A}. 

If  (b,  c )  e  K,  then  b  <'  a  ■<'  c  for  no  a  e  for  otherwise  there  would  be 
tide  A  with  b  <'  d  <‘  c  since  for  every  point  in  the  open  interval  (u(b),  u(c )) 
there  is  an  7  e  e  that  includes  the  point  with  7  <=  ( u(b ),  u(c)).  Hence  no  two 
open  intervals  (w(h),  u(c))  for  (b,  c)  e  K  overlap,  so  that  K must  be  countable. 
Therefore, 

B  ~  {b:be  AT/- — there  is  a  c  e  Af/- — -  such  that  (b,  c)  e  AT  or  (c,  b)  e  K) 

is  countable  and  hence  A  U  B  is  countable.  Moreover,  if  b,ce  Xj~  — 
A  U  B  and  b  <'  c,  then  there  is  an  a  e  A  U  B  such  that  b  <'  a  <'  c.  Thus 
the  countable  order  denseness  condition  is  necessary  for  (3.1).  + 

Proof  of  Sufficiency.  We  assume  that  <  on  X  is  a  weak  order  and  will 
woik  with  the  strict  order  <'  on  Xj~.  We  shall  assume  that  A  includes  the 
least  and/or  most  preferred  (<')  elements  in  Xj~,  if  such  exist,  and  that  A 
is  countable  and  is  ■<  '-order  dense  in  Xf^.  Let 

B  —  {b:b  e  XI~  —  A,  either  {a\a  e  A,b  <'  a)  has  a  least  preferred 
element  ab  or  {c:c  e  A,  c  <'  b}  has  a  most  preferred  element  cj. 

With  be  XI - A,{a:aeA,b  a}  and  {c:c  e  A,  c  <'  b)  are  two  disjoint 

subsets  of  A  whose  union  equals  A.  It  follows  that  a  given  a  e  A  can  be  an  ab 
for  at  most  one  b  e  Xj~  —  A ,  and  that  a  given  c  e  A  can  be  a  cb  for  at  most 
one  b  e  Xj™  —  A.  Hence  B  is  countable  and  therefore 

C  —  A  KJ  B 

is  countable.  Moreover, 

1.  There  is  no  least  preferred  ae{a:aeC,b  <'  a)  for  any  b  e  Xj~  —  C 

2.  There  is  no  most  preferred  c  e  {c:c  e  C,  c  <'  b}  for  any  b  e  2T/~  —  C. 


Preference  as  a  Strict  Pariktf  Order 


29 

For  proof,  suppose  (1)  is  false  and  ab  is  the  least  preferred  element  in 
{a:a  e  C,b  <'  a)  for  some  beXj~  —  C.  Then  ab  cannot  be  in  A ,  for 
otherwise  b  e  B.  But  then  c  <'  b  <*  ab  <'  a  for  all  c  e  {etc  6  A,  c  <'  b}  and 
alt  a  e  {a '.a  e  A,  b  <'  a)  and  there  is  no  element  in  A  between  b  and  ait  in 
violation  of  the  order  denseness  assumption.  Hence  (1)  is  true  and,  by  a 
symmetric  proof,  (2)  is  true. 

By  the  proof  of  Theorem  2.2  there  is  a  real-valued  function  u  on  C  such 
that  a  <'  c  o  u(a )  <  u(c),  for  all  a,  c  e  C.  For  each  b  e  Xj>**>  —  C  let 

a*  =  {u(a):a  eC,b  <  o} 

=  (u(c):c  eC,c  <b) 

and  set 

u(b)  4 (sup  +  inf  ub),  (3.2) 

where,  since  u(c)  <  w(a)  for  all  ceub  and  a  e  ub,  sup  ub  <,  inf  ub.  From  (2) 
and  (1)  above  it  follows  that  for  each  b  e  Xl~  —  C, 

u(c)  <  sup  ub,  for  all  u(c)  e  w6 

inf  ub  <  u(a),  for  all  u(a)  e  u6. 

Hence  u(c)  <  u(A)  <  «(a)  for  all  c  e  {c:c  e  C,  c  <'  and  all  a  e  {a: a  e  C, 
A  <'  a).  Hence  u(A)  jd  u{a)  when  b  e  —  C  and  a  e  C,  and  the  extension 
of  u  by  (3.2)  preserves  the  ordering  of  the  A  e  X}~  —  C  and  the  aeC. 

Suppose  then  that  A,  c  e  A7~  —  C.  If  A  <'  c  then  A  <'  a  <'  c  for  some 
a  g  C  so  that  u(A)  <  u(u)  and  u(a)  <  u(c)  and  hence  «(A)  <  w(c).  Conversely, 
if  u(b)  <  «(c),  there  is,  by  definition  of  supremum  and  (1),  a  u(a)  e  w6  such 
that  «(A)  <  u(a)  <  «(c),  which  yields  A  <'  a  and  a  <’ c  and  therefore 
A  <'  c  by  transitivity.  Hence,  for  all  a,  be  XJ~,  a  < '  A  <->  u(a)  <  «(A). 
Defining  u(x)  =  u(a)  when  *ea,  (3.1)  follows.  ♦ 

The  above  proof  is  patterned  after  outlines  in  Birkhoff  (1948,  p.  32)  and 
Luce  and  Suppes  (1965,  pp.  263-264).  Our  proof  is  similar  also  to  Debreu’s 
proof  of  his  Lemma  II  (1954,  pp.  161-162). 

3.2  PREFERENCE  AS  A  STRICT  PARTIAL  ORDER 

We  shall  now  consider  an  appropriate  generalization  of  Theorem  2.5  for 
strict  partial  orders.  Throughout  this  section  <  *  on  Xj<a  is  defined  as  in 
(2.7)  with  fa  as  in  (2.6). 

THEOREM  3.2.  Suppose  <  on  X  is  a  strict  partial  order  and  there  is  a 
countable  subset  of  Xjfa  that  is  <  * -order  dense  in  Xj  fa.  Then  there  is  a 


30  Utility  Theory  for  Uncountable  Sets 

real-valued  function  u  on  X  such  that 

z  <:  y=>  u(x)  <  u(y),  for  all  x,yeX,  (3.3) 

z  &  y  =>  u{z)  =  «(y),  /or  a//  x,y  eX.  (3.4) 

In  this  case  the  denseness  condition  is  not  necessary  for  (3.3)  and  (3.4). 
Suppose  for  example  that  X  =»  Re  and  define 

x  <  y  o  x  <  y  and  y  —  x  +  n  for  some  positive  integer  n. 

Then  u(z)  mm  x  satisfies  (3.3)  and  (3.4),  and  X{&  -  {{arJrzeJT}.  IfZcJ 
is  countable  then  there  is  an  a;  such  that  neither  *  nor  *4-  1  is  in  Z.  But  with 
*  <  x  +  i,  there  is  no  z  e Z  such  that  x  <  z  <  x  +  1.  It  follows  that  there 
is  no  countable  subset  of  Xjea  that  is  <  ‘-order  dense  in  Xjns, 

Our  proof  of  Theorem  3.2  is  based  on  an  ingenious  proof  of  a  somewhat 
more  general  theorem  given  by  Richter  (1966). 

Proof  of  Theorem  3.2.  Let  the  hypotheses  of  the  theorem  hold.  By 
Theorem  2.3,  <  *  on  Xjfv  is  a  strict  partial  order.  Let  A  be  a  countable  subset 
of  Xj  w  that  is  <  ‘-order  dense  in  Xj  t&.  By  Theorem  2.4  there  is  a  strict  order 
<°  on  X/fin  that  includes  <*:a  <* b  =>  a  <° 6.  Define  a  binary  relation  E 
on  X/fa  as  follows: 

aEboa  —  b  or  (a,  b  $  A  and  a  <°  c  <°b  or  b  <°c  <0ar  for  no  c  e  A). 

Then  E  is  obviously  reflexive  and  symmetric  and  is  in  fact  an  equivalence  on 
X/*v.  For  transitivity  suppose  (aEb,  bEc)  with  a  b  ^  c  yt  a  (to  avoid  the 
trivial  cases).  If  (a  <°  b,  b  <°  c)  or  (a  <°  b,  c  <°  b )  or  ( b  <°  a,  b  <°  c )  or 
(b  a,  c  <°  b),  which  are  the  only  four  possibilities,  then  there  is  no  d  e  A 
such  that  a  <°  d  <°  c  or  c  <°  d  <°  a.  Hence  aEc. 

Let  r,  s ,  and  t  be  equivalence  classes  in  the  set  of  such  classes  in  X/pu  under 
j E  That  is,  r  e  (Xjf^)jE.  Define  <x  on  these  classes  as  follows: 

r<lsor^s  and  a<°6  for  some  (and  thus  for  all)  a  e  r,  b  e  s. 

Sine;  <°  on  Xf^s  is  a  strict  order  and  E  on  Xjt*  is  an  equivalence,  <*  on 
X/f^/E  is  a  strict  order.  Moreover,  B  =  {r:r e X/^/E  and  aer  for  some 
a  €  A)  is  < border  dense  in  Xj^/E.  For  suppose  r,  s  are  not  in  J9  and  r  <ls. 
Then,  with  a  e  r  and  b  e  s,  a  <°  b  and  a,b$  A.  Since  not  aEb  there  must  be 
a  c  eA  such  that  a  <°  c  <°  b.  With  c  e  /  it  follows  that  t  e  B  and  r  <1 1  <l  s. 

It  then  follows  from  the  proof  of  Theorem  3. 1  that  there  is  a  real-valued 
function  f  on  Xj^jE  such  that 

r  <1  s  o  fif)  <  /(s),  for  all  r,se  Xj^/E.  (3.5) 

Suppose  that,  with  aer  and  bes,  a  <*  b.  Then  a  <° b.  Therefore  either 


..t«»t^!i«a5asa»teS<ia  VJ8?  s>'_ : 


Preferences  oh  Re”  31 

r  =  5  or  r  <1  If  either  a  or  b  is  in  A  then  r  fS  s  since  a  5*  b  and  hence  not 
aEb.  Vta,b$A  and  r  =  s  then  a  <°  c  <°  b  for  noce^,  which  is  false  since 
A  is  <  •-order  dense  in  Xjf&  and  hence  if  a  <  *  b  and  a,  b$  A  then  a  <  * 
c  <  *  b  (and  thus  a<8c<°  b)  for  some  c  g  A.  Therefore  a  <.*  b  r  s. 
Defining  u(a )  =  f(r)  when  a  e  r  it  follows  from  (3.5)  that  if  a  <  *  b  then 
u(a)  <  u(b).  Defining  u(x)  =  u(a)  when  x  e  a  and  observing  that  if  x  <  y 
and  (a?  ea,y  eb)  then  a  <  *  b,  it  follows  that  x  <  y  =>  u(x)  <  u(g).  It  is 
clear  also  that  u(x)  —  u(y)  when  x,y  go.  ♦ 


3.3  PREFERENCES  ON  Re” 

Preferences  in  many  decision  situations  are  influenced  by  multiple  factors. 
Hence  a  large  part  of  our  study  will  focus  on  sets  whose  elements  are  n-tuples. 
When  the  components  of  the  «-tuples  are  real  numbers,  the  n-tuples  are  called 
vectors. 

This  section  looks  at  the  special  case  where  X  equals  Re"  or  is  a  rectangular 
subset  of  Re",  by  which  is  meant  the  Cartesian  product  of  n  real  intervals, 
including  perhaps  infinite  intervals  such  as  (a,  oo),  the  set  of  all  numbers 
greater  than  a,  and  (—00,  00)  =  Re. 

When  (xlt  and  {yx, . . .  ,  y„)  are  vectors  in  Re"  and  a,  (i  are  scalars 

(real  numbers),  we  define  multiplication  by  scalars  and  vector  addition  by 

*x  +  Py-  (a*!,  •  •  •  ,  «*«)  +  (ft/i,  •  •  •  ,  $/„) 

=  (aarx  +  0ylt  . . .  ,  +  fiyn).  (3.6) 

After  illustrating  a  utility  function  for  increasing  preferences  in  two 
dimensions  we  shall  consider  some  formal  theory  for  such  cases. 

Example 

We  consider  preferences  of  the  president  of  a  company  on  a  set  of  two- 
dimensional  vectors  (xv  x where  Xy  denotes  net  profit  for  the  coming  year 
and  xa  denotes  the  company’s  market  share  for  the  coming  year.  Xt  — 
[-S5  million,  35  million]  and  X2  —  [10%,  30%].  A  utility  surface  that 
might  reflect  the  president’s  preferences  is  shown  in  Figure  3.1.  If  <  on 
Xy  x  X2  is  a  weak  order  and  (3.1)  holds  then  all  (xlt  x2)  e  Xy  x  X2  with  equal 
utility  constitute  an  element  in  Xj^.  These  equivalence  classes  are  variously 
called  indifference  curves,  trade-off  curves,  indifference  loci,  isoutility  contours, 
and  so  forth.  The  family  of  indifference  curves  in  the  plane  constitutes  an 
indifference  map.  Two  curves  of  the  indifference  map  are  illustrated  in  the 
figure. 

If  indifference  were  not  transitive  in  this  example  then  the  preceding 
interpretation  for  an  element  in  X/~  does  not  apply. 


% 

-a 


1 


32 


Utility  Theory  for  Uncountable  Sets 


Figure  3.1  Unidimensional  utilities  on  n  two-dimensional  space. 

Increasing  Preferences  with  Weak  Orders 

Let  Xif  i  *s  1,  2 . n  be  nonempty  sets.  Their  Cartesian  product  is 

Xt  X  A'j  x  •  •  •  X  Xn  =  {(*j,  x2, . . .  ,  eXi  for  /  =  1,2,..., «}.  In 
this  subsection  we  assume  that  each  X(  is  an  interval  of  real  numbers,  so  that 
X  —  A\  'A  x  Xn  is  a.  rectangular  subset  of  Re'*.  Elements  in  Xt  could  be 
amounts  of  money  allocated  to  activity  i  or  earned  in  year  /,  or  they  could  be 
amounts  of  commodity  /  purchased  during  a  fixed  time  period,  and  so  forth. 

With  x  =  (xlt and  y  =  (,y„  •  •  •  , Vn)>  we  define  x  <  y  o  x  ^  y 
and  x,  <  y{  for  i  1 , .  .  .  ,  n. 

THEOREM  3.3.  Suppose  that  X  is  a  rectangular  subset  of  Re”  and  that  the 
folios  I'lg  hold  throughout  X: 

1.  <  on  X  is  a  weak  order, 

2.  ^  y  "->  x  -<^  y, 

3.  (x  <  y,  y  <  z)  ^=>  %x  +  (1  —  x)z  <  y  and  y  <  /?x  +  (1  —  /?)z  for  some 

x,f}e  (0,  1). 

Then  there  is  a  real-valued  function  u  on  X  that  satisfies  (3.1). 

The  second  condition  (monotonicity,  nonsaturation,  nonsatiety,  domi¬ 
nance,  etc.)  states  that  preference  increases  with  any  increase  in  quantity. 
Condition  3  is  an  Archimedean  condition  that  will  be  used  to  establish  a 
countable  order  dense  subset.  For  the  third  condition  to  hold  it  may  be 
necessary  in  some  cases  to  have  a  very  near  to  1  and  fi  very  near  to  zero. 
In  proving  the  theorem  we  shall  first  prove  the  following  lemma. 


Preferences  on  Re' 


33 


LEMMA  3.1.  The  hypotheses  of  Theorem  3.3  imply  that  if  x,y,  z  e  X  and 
x  <y  <z,  then  y  ~  ox  -f-  (1  —  a )z  for  exactly  one  a  e  (0, 1). 

Proof  If  y  ~  ax  -f  ( 1  —  a)z  for  no  a  s  (0,  1),  it  follows  from  the 
hypotheses  that  there  is  a  /S  e  (0, 1)  such  that  either 


y  <  ax  -f  (1  -  a)z 

for  all  a  <;  /? 

(3.7) 

ax  +  (1  —  a)z  <  y 

for  all  a  >  /? 

(3.8) 

y  <  ax  -f-  (1  -  a)z 

for  all  a  <  /? 

(3.9) 

ax  -f  (1  —  <x)2  <  y 

for  all  a  ^  /3. 

(3.10) 

We  consider  the  latter  case.  By  (3.10)  and  the  hypotheses,  /fa  +  (1  —  fi)z  < 
y  <z.  Hence,  by  condition  3  of  Theorem  3.3,  there  is  an  a  e  (0, 1)  such  that 

a  [/fa  +  (1  —  f})z]  +  (1  —  a)z  <  y,  or  a/fa  -f  (1  —  a/3)z  <  y.  But  since 

a/3  <  /J,  (3.9)  says  that  y  <  a/fa  -j-  (1  —  a/?)z,  a  contradiction.  Hence  (3.9) 
and  (3.10)  can’t  hold.  A  similar  proof  shows  that  (3.7)  and  (3.8)  can’t  hold. 
Hence  y  ~  ax  +  (1  —  a)z  for  some  a  e  (0,  1).  If  y  n~>  axx  -4-  (I  —  ax)z  and 
y  ~  a2x  +  (1  —  a2)z  then  axx  +  (1  —  at)z  ~  a2x  +  (1  —  a2)z  by  the  transi¬ 
tivity  of  which  can  only  be  true  if  «x  =  a2:  for  if  ax  <  a2  then  -j- 

(1  —  a2)z  <  oqx  +  (1  —  ax)z  since  x  <  z.  ♦ 

Proof  of  Theorem  3.3.  In  view  of  Theorem  3. 1  we  need  to  show  that  A'/'*-' 
contains  a  countable  subset  that  is  < '-order  dense  in  X/~. 

Let  Y(  be  the  set  of  all  rational  numbers  plus  any  finite  end  point  at  any 
closed  end  of  Xt  (if  such  exist).  Let  Zx  =  X{  ft  Yf.  Zi  is  countable.  Let 
Wi  —  {a xt  +  (1  —  a)«/j :  a  is  a  rational  number  in  [0,  1]  and  x,.,  e  ZJ. 
is  a  countable  set.  Let  W  =  Wx  x  W2  X  •  ■  •  x  Wn.  W  is  countable.  Let  A 
consist  of  all  elements  in  Xj~  that  contain  one  or  more  elements  in  W.  A  is 
countable  since  any  x  e  W  is  in  exactly  one  a  e  X/~.  Suppose  a,  be  X/~  —  A 
with  a  <'  b.  We  need  to  show  that  there  is  a  ce  A  such  that  a  <'  c  <'  b. 
To  do  this  it  will  suffice  to  show  that  when  x,y  e  X  —  W  and  x  <  y  then 
there  is  a  z  e  W  such  that  x  <  z  <  y.  We  consider  two  cases  as  follows. 

Case  l:  x  <  y.  Then  there  are  zl,  z2  eZx  x  •  •  ■  x  Zn  such  that  z1  <  x 
and  y  <  z2.  Lemma  3.1,  weak  order,  and  condition  2  of  the  theorem  imply 
that  there  are  a,  /?  with  0  <  a  <  <  1  such  that  x~/? z1  +  (1  —  /?)z8, 

i/ ~  azl  4-  (1  —  a)z2.  Let  y  be  any  rational  number  in  the  interval  (a,  /3). 
Then,  by  weak  order  and  conditior  2,  x  <  yz1  +  (l  —  y)z2  <  y.  Since 
zl,  z2  e  Zj  x  •  •  •  x  Z„  and  y  is  rational,  yz1  +  (l  —  y)z2  e  IT. 

Core  2:  x  <y  is  false  (with  x,  y  e  X —  W  and  x  <  y).  Let  vt  = 
inf  {x,,  yj  and  w,  =  sup  {xi5  yt).  Then  v  <  x  <  wand  v  <y  <  w.  It  follows 
that  there  are  a,  /?  with  0  <  a  <  /?  <  1  such  that  x  fa  +  ( 1  —  /?)*’  and 


34 


Utility  Theory  for  Uncountable  Sets 


y  ~  xv  +  (1  —  a)w.  Since  /5t>  -f-  (1  —  (3)w  <  at>  +  (1  —  a)w ,  it  follows 
from  the  Case  1  proof  that  there  is  a  z  t  W  such  that  (tv  +  (1  —  fi)w  <  z  < 
(tv  4-  (I  —  a)H’,  Hence  x  <  z  <  y.  ♦ 

If  preference  decreases  rather  than  increases  as  x{  e  X(  increases,  Theorem 
3.3  can  still  be  used  after  a  change  of  variable  from  x(  to  y{  ~  — x,. 

Nondecreasing  Preferences  with  Strict  Partial  Order 

We  conclude  this  section  with  a  theorem  that  uses  generally  weaker  con¬ 
ditions  than  those  of  the  preceding  theorem.  We  shall  use  the  non-negative 
orthant  {(xlt  ^  0  for  /  =  1 of  Re".  This  is  often  used  by 

mathematical  economists  in  investigations  of  consumer  preference  or  con¬ 
sumer  choice.  In  this  context  the  vectors  are  called  commodity  bundles. 

*  «  y  means  that  xt  <  yi  for  i  —  1, ...,  n. 

THEOREM  3.4.  Suppose  that  X  is  the  non-negative  orthant  of  Re"  and  that 
the  following  hold  throughout  X; 

1.  <  on  X  is  a  strict  partial  order, 

2.  [(*  «  V,  V  <  2)  or  (x<y,y  «  z)]  x  <  z, 

3.  x  <  y  z  <  y  for  some  z  such  that  x  «  z. 

Then  there  is  a  real-valued  function  u  on  X  that  satisfies  (3.3). 

The  notion  of  nondecreasing  preferences  comes  from  condition  2.  Irre- 
flexivity  and  condition  2  say  that  x  «  y  not  y  <  x:  an  increase  in  every 
commodity  does  not  decrease  preference.  Condition  3  says  that  if  y  is  pre¬ 
ferred  to  x  then  increases  (perhaps  very  slight)  can  be  made  in  all  components 
of  x,  and  y  will  still  be  preferred  to  the  augmented  x. 

Proof  of  Theorem  3.4.  Let  the  hypotheses  hold.  Define  x  -e}  y  ox  -<y 
or  x  «  y.  Conditions  1  and  2  imply  that  is  a  strict  partial  order.  From 
<l  we  can  define  ~1  and  in  the  manner  of  (2.2)  and  (2.6).  By  Theorem 
2.3,  tv1  on  X  is  an  equivalence  and  <l*  on  X/f*1,  defined  in  the  manner  of 
(2.7),  is  a  strict  partial  order.  To  show  that  there  is  a  countable  subset  of 
Xjfin1  that  is  <1*-order  dense  in  X^1,  it  suffices  to  show  that  the  set  of 
rational  vectors  in  X  (all  components  rational)  is  <x-order  dense  in  X. 
Suppose  then  that  x  and  y  are  not  rational  and  x  < 1  y.  If  x  «  y  then  x  « 
z  «  y  for  some  rational  2,  and  hence  x  <}  z  y.  If  x  <  y  then,  by  condition 
3,  z  <  y  for  some  z  such  that  x  «  z.  Then  x  «  /  «  z  for  some  rational  t.  By 
condition  2,  t  <  y.  Hence  x  <,*  t  <l  y.  Therefore,  by  Theorem  3.2,  there  is 
a  real-valued  function  w  on  X  such  that  x  C1  y  u(x)  <  u(y).  Then  x  <  y  => 
u(x)  <  u(y)  since  x  <  y  x  y.  ♦ 


Continuous  Utilities 


35 


3.4  CONTINUOUS  UTILITIES 

Continuity  formalizes  the  intuitive  notion  that  if  two  elements  in  X  are 
not  very  different  then  their  utilities  should  be  close  together.  The  difference 
between  x  and  y  can  be  thought  of  either  in  terms  of  their  relative  proximity 
under  <  or  in  terms  of  a  structure  for  X  that  is  related  to  <  in  some  way. 

Part  of  the  interest  in  continuity  stems  from  the  fact  that,  when  continuity 
holds,  the  utility  function  will  attain  a  maximum  value  on  a  suitably  restricted 
subset  of  X.  Suppose  for  example  that  X  is  the  non-negative  orthant  of  Ren 
and  that  an  individual  can  spend  his  income  m  |>  0  on  the  n  commodities 
whose  unit  prices  are  px  >  0,  p2  >  0, . . .  ,  pn  >  0.  His  choice  is  restricted  to 
{p,  m)  =  {x:x  e  X  and  <,  w}.  If  <  satisfies  the  conditions  of 

Theorem  3.3  then  there  is  a  u  that  satisfies  (3.1)  and  is  continuous,  and  there 
is  an  x*  e(p,  m)  that  satisfies  px*  =  m  and  sup  {u{x):x  e  (p,  m)}  »  u(x*). 
Or  suppose  that  <  satisfies  the  conditions  of  Theorem  3.4.  Then  there  is  a  u 
that  satisfies  (3.3)  and  is  upper  semicontinuous  and  there  is  an  x*  e  (p,  m) 
such  that  sup  {u(x):x  e  (p,  m)}  ~  u(x*).  (See,  for  example,  Thielman  (1953, 

p.  102).) 

Definitions  for  Continuity 

To  consider  a  general  definition  of  continuity  we  require  the  following 
notions.  The  union  (U)  of  a  set  of  subsets  of  X  is  the  set  of  elements  that 
appear  in  at  least  one  of  the  subsets.  The  intersection  (n)  of  a  set  of  subsets  of 
X  is  the  set  of  elements  that  appear  in  every  one  of  the  subsets. 

Definition  3.2.  A  topology  S  for  a  set  X  is  a  set  of  subsets  of  X  such  that 

1.  The  empty  set  0  (which  is  always  a  subset  of  X)  is  in  IS, 

2.  Jfe  13, 

3.  The  union  of  arbitrarily  many  sets  in  13  is  in  13, 

4.  The  intersection  of  any  finite  number  of  sets  in  13  is  in  13. 

If  13  is  a  topology  for  X ,  the  pair  (X,  1 3)  is  a  topological  space.  By  definition, 
the  subsets  of  X  in  15  are  called  open  sets. 

The  usual  topology  TL  for  Re  is  the  set  of  open  intervals  along  with  their 
arbitrary  unions  and  finite  intersections.  The  relative  usual  topology  for 

£  Re  is  {A  n  X:  A  e^}.  When  X  =  [0,  2],  the  closed  interval  [0,  2]  is 
an  open  set  in  the  relative  usual  topology,  but  it  is  the  only  nonempty  closed 
interval  in  X  that  is  an  open  set  in  the  relative  usual  topology. 

Definition  3.3.  If  (X,  13)  is  a  topological  space  then  a  real-valued  function 
u  on  X  is  continuous  in  the  topology  13  if  and  only  if  A  e  TL  =>  {x:x  e  X, 
u(x)  e  A }  e  13. 


36 


Utility  Theory  for  Uncountable  Sets 


Suppose  X  —  [0,2]  and  IS  =  {A  n  [0,2]:  A  £%}.  Then  the  function 
u{x)  —  x  for  all  x  e  A' is  continuous  in  IS,  but  the  two-part  function  fix)  =  x 
for  x  e  [0, 1  ]  and /(#)  =  *  +  1  for  x  e  (1 ,  2]  is  not  continuous  because  of  its 
gap  or  jump  at  x  =»  l.  For  example,  (i/2,  3/2)  e  Tl  but  { x:x  e  [0, 2],  f{x)  e 
(1/2,  3/2»  ==  (1/2S  1)  is  not  in  IS. 

Necessary  and  Sufficient  Conditions  for  Continuity 

Assume  that  u  on  X  satisfies  (3.1)  and  is  continuous  in  the  topology  15. 
For  any  y  e  X  the  sets  {b:b  <  u(y)}  and  {a:u{y)  <  a }  are  open  sets  in  TJL: 
hence  (x.xeX,  x<  y)  and  [x:x  e  X,y  <  x}  must  be  open  sets  in  15  for 
every  y  e  X.  Again,  if  u  is  continuous  in  the  topology  15  and  if  x  <  y,  so  that 
u(x )  <  u(y),  then  there  are  open  sets  Ax,  Ay  e  CU  such  that  u(x)  e  Am  and 
a  <  u(y)  for  every  a  e  Ax>  and  u(y)  e  Av  and  u(x)  <  b  for  every  b  e  Av:  hence 
there  is  an  open  set  {z:u(z)  e  AJ  containing  x  such  that  z  <  y  for  every  z  in 
this  set  and  there  is  an  open  set  (w:m(w)  e  Av }  containing  y  such  that  x  <  w 
for  every  w  in  this  set. 

The  foregoing  paragraph  sets  forth  two  necessary  conditions  for  continuity. 
Each  condition  is  also  sufficient  for  continuity. 

THEOREM  3.5.  If  (X,  15)  is  a  topological  space  and  there  is  a  real-valued 
function  on  X satisfying  (3. 1),  then  there  is  a  real-valued function  on  X  satisfying 
(3. 1)  and  continuous  in  the  topology  15  if  and  only  if 

1.  {*:*  e  X,  x  <  y)e  IS  and  {x:x  e  X,  y  <  x}  e  IS  for  every  y  e  X,  or 

2.  If  x,  y  6  X  and  x  <  y,  then  there  are  sets  Tx,  Tye  15  such  that  x  e  Tx, 
y  e  7V,  x'  <  y  for  every  x'  e  Tx  and  x  <  y'  for  every  y'  e  Tv. 

Proof  The  sufficiency  of  conditions  I  and  2  for  continuity  can  be  estab¬ 
lished  by  showing  that  2  implies  1  and  that  1  implies  that  some  u  satisfying 
(3.1)  is  continuous  in  15, 

Let  y  be  any  element  in  X.  We  show  that  condition  2  implies  that  {x:x  e  X, 
x  <  y}  e  15 ;  a  symmetric  proof  suffices  for  the  other  part  of  condition  1 .  If 
x  <  y  for  no  x  e  X  then  {x:x  e  X,  x  <  y)  =  0 ,  which  is  in  IS.  If  x  <  y , 
then  by  condition  2  there  is  a  set  Tx  e  15  containing  x  such  that  x'  <  y  for 
all  x'  e  Tx.  The  union  of  all  such  Tx  is  {x:x  e  X.  x  <  y },  which  is  in  15  by 
part  3  of  Definition  3.2. 

To  show  that  condition  1  implies  that  some  u  satisfying  (3.1)  is  continuous 
in  15,  we  follow  Debreu  (1964).  Let  h  on  I  satisfy  (3.1),  with  u(X)  — 
{«( x)  :x  e  X}.  A  gap  of  u(X)  is  a  nonempty  interval  I  in  Re  such  that  no  point 
in  u(X )  is  in  /  and,  with  a  el,  I  =  {6:u(x)  <  b  <  u(y)  for  ali  u(x)  e 
{«(£):*  e  X,  u(x)  <  a)  and  all  u{y)  e  {u(y):y  e  X,  a  <  «(y)}}.  Debreu’s  basic 
theorem  (p.  285)  asserts  that,  with  u  on  X  satisfying  (3.1),  there  is  a  function 


Continuous  Utilities 


37 


v  on  X  that  satisfies  (3.1)  such  that  all  gaps  of  v(X)  are  open  intervals  in  Hi. 
Debreu ’s  proof  of  this  (pp.  285-289)  will  not  be  repeated  here. 

Let  v  on  X  satisfy  (3.1)  with  all  gaps  of  v(X)  open.  With  ae  Re,  let 
(—oo^eHL  be  the  open  interval  of  all  numbers  less  than  a.  If  aev(X) 
with  a  =  v(y)t  then  { x:v(x )  e  (—  oo,  a)}  =  {x:x  <  y }  which  by  condition  1 
is  in  IS.  If  a  $  v(X)  and  a  is  in  a  gap  of  v(X),  this  gap  has  the  form  (alf  a2) 
with  a  e  (als  a2)  and  alf  a2  e  v(X):  then,  (x:v(x)  e  (—  oo,  a)}  —  {x:x  <  z } 
where  as  ~  v(z ),  and  again  by  condition  1  this  set  is  in  IS.  Finally,  if  a  $  v(X) 
and  it  is  in  no  gap  of  v(X),  then  either 

1 .  a  <,  inf  v(X)  so  that  {xw{x)  e  (—  oo,  a)}  ~  0 ,  in  IS,  or 

2.  sup  v(X)  <,  a  so  that  (x:v(x)  e  (—  oo,  a)}  —  X,  in  TS,  or 

3.  a  =  sup  {v(x):z  e  X,  v(x )  <  a }  so  that  {x:v(x)  e  (—  oo,  #)},  the  union 

of  all  sets  of  the  form  {x:x  <  y,  v(y)  <  a},  is  in  IS  since  each  set  in  the  union 
is  in  75.  Thus  {x:v(x)  e  (—  oo,  a)}  e  75  for  every  a  e  Re,  and,  by  a  symmetric 
proof,  {x:v(x)  e  (b,  oo)}  e  '6  for  every  b  e  Re.  Since  any  bounded  open 
interval  (a,  is  the  intersection  cf  (a,  oo^fiJL  and  (— oo,  b)  eHJL, 

{x:v{x)  e  (a,  b)}  is  the  intersection  of  two  sets  in  75  and  hence  is  in  75.  Since 
any  A  e  HL  is  formed  by  arbitrary  unions  and  finite  intersections  of  open 
intervals  in  Re,  the  corresponding  set  {x:v(x)  e  A)  can  be  formed  in  a  similar 
way  from  sets  in  75  and  hence  is  in  75.  ♦ 

Contributions  to  continuity  in  the  context  considered  in  this  subsection 
have  been  made  also  by  Eilenberg  (1941),  Newman  and  Read  (1961),  and 
Rader  (1963).  Condition  2  of  Theorem  3.3  is  identical  to  Condition  B,  p.  160, 
in  Newman  and  Read.  Debreu  (1964)  includes  most  of  the  important  results 
in  this  area. 

Continuity  of  Increasing  Utilities  on  Re” 

For  Re"  we  shall  let  ‘ltn  be  the  set  of  all  open  rectangles  along  with  their 
arbitrary  unions  and  finite  intersections.  With  X  a  rectangular  subset  of  Ren 
this  subsection  examines  the  continuity  of  u  on  X  with  respect  to  the  relative 
topology  {A  C\  X:  A  e  HL"}.  The  following  theorem  is  slightly  different  than 
very  similar  theorems  on  continuity  discussed  by  Wold  (1943),  Wold  and 
Jureen  (1953),  Yokoyama  (1956),  Debreu  (1959),  and  Newman  and  Read 
(1961),  The  proof  is  similar  to  Yokoyama’s. 

THEOREM  3,6.  The  hypotheses  of  Theorem  3.3  imply  that  there  is  a  real¬ 
valued  function  on  X  that  satisfies  (3.1)  and  is  continuous  in  the  topology 
{A  n  X:A  eHl”}. 

Proof.  Considering  Theorems  3.3  and  3.5  we  need  only  show  that 
condition  2  of  Theorem  3.5  holds  under  the  stated  hypotheses  when 


38 


Utility  Theory  for  Uncountable  Sets 


15  -  {A  n  X.A  e  'll”}.  With  TS  =  (A  n  X-.Ae'W*}  and  x  <y  we  show 
that  there  is  a  F„  e  15  such  that  y  e  Ty  and  x  <  z  for  every  z  e  Tv.  The 
proof  concerning  Tx  is  symmetric  to  this  proof  and  is  left  to  the  reader. 

With  x  <  yt  let  vt  —  inf  {xit  y{}  and  if  y*  is  greater  than  some  element  in 
Xt  let  0/  be  any  element  in  Xt  less  than  v t:  otherwise  let  v[  =  y<.  Then  v'  <,  v, 
v  5s  v  <  y.  If  v'  =  x,  then  x  <  2  for  all  z  5*  x,  z  e  X,  and  any  Tv  containing 
y  but  not  x  suffices.  Henceforth  we  assume  that  v'  <  x,  so  that  v'  <  z  <  y, 
implying  by  condition  3  of  Theorem  3.3  that  for  some  a  e  (0, 1),  *  <  at?'  + 
(1  —  oc)y.  Now  ay/  +  (1  —  a)^  <,  y{  for  all  i  and  strict  inequality  holds  for 
some  i.  Let  e  >  0  be  smaller  than  the  smallest  y{  —  [ay/  -f  (1  —  a)?/,]  for 
which  the  difference  is  positive.  Then  ay/  +  (1  —  o.)y{  <y{  —  <■  for  all  i 
for  which  —  [ay/  -f  (1  —  o)y{]  >  0.  If  v-  —  yit  then  any  less  than  yf  is 
not  in  Xs.  Let  Ty  ~  (yx  -  e,  yx  +  e)  x  (y2  -  e,  yz  +  e)  x  *  •  ■  x  ( yn  -  c, 
yn  -F  e)  and  let  Tv  —  T'y  n  X.  Then  TveT>  and  for  every  z  g  Tv,  ay'  +• 

(1  —  ot)y  <  z,  so  that  x  <  2  for  every  2  6  Tv.  ♦ 

Upper  Semicontinuity  with  Strict  Partial  Order 

Definition  3.4.  If  (X,  IS)  is  a  topological  space  then  a  real-valued  function 
u  on  X  is  upper  semicontinuous  in  the  topology  '6  if  and  only  if 

{x:x  e  X,  u(x)  <  c}  e  IS  for  each  real  number  c.  (3.11) 

Lower  semicontinuity  is  defined  by  (3.11)  after  <  is  changed  to  >.  Given 
a  bounded,  real-valued  function / on  X  let  u  on  X  be  defined  by 

u(x)  =  inf  {sup  {f(y)  -y  e  T}  :x  e  T,  Te  1?}.  (3.12) 

For  a  given  real  number  c  suppose  w(x)  <c  for  no  x.  Then  {x:xeX, 
u(x)  <  c)  =  0 ,  which  is  in  13.  Suppose  u(x)  <  c  for  some  x  e  X.  Then 
there  is  a  Tx  e  13  such  that  x  e  Tx  and  sup  {f{y)’.y  e  Tx)  <  c.  It  follows  from 
(3.12)  that  u(y)  <  c  for  every  y  e  Tx.  Hence,  for  each  x  such  that  u(x)  <  c 
there  is  a  Tx  e  IS  such  that  x  e  Tx  and  u(y)  <  c  for  every  y  e  Tx.  The  union 

of  all  such  Tx  will  equal  {x:x  e  X,  u(x)  <  c},  and  this  union  is  in  6  by 

Definition  3.2(3).  Hence  u  b  upper  semicontinuous  in  13.  We  shall  use  this 
observation  in  proving  the  following  theorem. 

THEOREM  3.7.  The  hypotheses  of  Theorem  3.4  imply  that  there  is  a  real¬ 
valued  function  on  X  that  satisfies  (3.3)  and  is  upper  semicontinuous  in  the 
topology  {A  n  X:  A  e 

Proof.  As  in  the  proof  of  Theorem  3.4  let  C1  on  the  non-negative  orthant 
of  Re"  be  defined  as  the  union  of  <  and  «.  From  that  proof  there  is  a  real- 
valued  function  /  or.  X  that  satisfies  x< 1  y  =>  f{x)  <f{y).  By  a  simple 
monotonic  transformation  if  necessary,  we  can  suppose  that  /  is  bounded. 


Index  to  Exercises 


39 


Then,  with  u  defined  as  in  (3.12),  u  is  upper  semicontinuous  in  the  relative 
topology  IS  =  {A  n  X:A  e  CHS}. 

It  remains  to  show  that  x  <  y  =>  u(x)  <  u(t/).  Suppose  x  <  y.  Then,  by 
condition  3  of  Theorem  3.4,  z  <  y  for  some  zsX  for  which  ar  «  2.  There  is 
then  an  open  rectangle  T9  e  "6  that  contains  *  and  has  all  dements  «  2,  so 
that/(f)  <  f  (z)  for  all  t  e  Ta,  so  that  u(x)  <£/(z).  Along  with  /(z)  <f(y) 
from  2  <  y,  and  f(y)  <,  u{y)  by  the  definition  of  u,  this  gives  u(x)  <  u(y) 
as  desired.  ♦ 

I  am  indebted  to  Hurwic?  and  Richter  (1970)  for  the  approach  used  in  this 
proof. 


3.5  SUMMARY 

When  X  is  uncountable  and  <  on  A"  is  a  weak  order,  preferences  can  be 
faithfully  represented  by  a  real-valued  function  if  and  only  if  there  is  a 
countable  subset  Y  of  X  such  that  whenever  x  <  y  there  is  a  z  e  Y  such  that 
(x  <  2  or  x  ~  2)  and  (2  y  or  2  <  y).  Lexicographic  preference  orders  give 
examples  where  this  denseness  condition  fails.  With  <  assumed  only  to  be  a 
strict  partial  order,  we  have  given  a  sufficient  but  not  necessary  countable 
order  denseness  condition  for  real-valued  utilities. 

When  X  is  a  rectangular  subset  of  «-dimensional  Euclidean  space  and 
preference  increases  (or  does  not  decrease)  with  increases  along  any  dimen¬ 
sion,  conditions  that  make  better  intuitive  sense  than  plain  order  denseness 
lead  to  real-valued  utilities. 

If  (3.1)  holds  for  u  on  X  then  there  is  a  continuous  (in  a  specified  topology 
'<5)  utility  function  on  X  if  and  only  if  x  <  y  implies  that  there  are  two 
subsets  of  X  in  T?  one  of  which  contains  x  and  has  every  element  less  preferred 
than  y  and  the  other  of  which  contains  y  and  has  every  element  preferred  to  x. 
The  conditions  on  <  used  in  the  weak  order  and  strict  partial  order  theorems 
for  utilities  on  regions  of  Re"  also  imply  the  existence  of  continuous  (weak 
order)  and  upper  semicontinuous  (strict  partial  order)  utility  functions. 


INDEX  TO  EXERCISES 

1.  Uncountable  sets.  2.  Denseness  of  rationals.  3.  Lexicographic  order.  4.  Theorem  3.1. 
5.  Theorem  3.2.  6-7.  Vector  operations.  8.  Indifference  map.  9-10.  Lemma  3. 1.11.  Asym¬ 
metric  transitive  closure.  12.  Discrete  topology.  13.  Closed  intervals  not  open.  14.  Dis¬ 
continuity.  15.  Theorem  3.5.  16-18.  Connected  topological  spaces.  19.  Theorem  3.4. 
20.  Lower  semicontinuity.  21  22.  Wold’s  continuity  condition.  23.  Convex  sets. 


40 


Utility  Theory  for  Uncountable  Sets 


Exercises 

1.  Prove  that  Re  is  uncountable  by  supposing  that  {0  .  x^jX,  ' '  ‘:x,  e  { 1 , 2} 
for  /  —  1,  2,  3, . . .}  £  Re  is  countable  and  showing  that  this  supposition  is  false. 
Note  also  that  {(xj,  a-,, . .  ,):xi  e  {0,  1}  for  all  /}  is  uncountable. 

2.  Let  a  and  b  be  numbers  with  a  <  b.  Show  that  there  is  a  rational  number  ?n 
the  open  interval  ( a ,  b).  Use  the  fact  (or  axiom)  that  there  is  a  positive  integer  n 
such  that  1  <  n(b  —  a).  Let  m  be  the  smallest  integer  greater  than  a  and  show  that 
mjn  e  (a,  b). 

3.  For  the  second  example  following  Theorem  3.1  where  X  —  [  — 1, 1],  show  that 
preferences  can  be  represented  by  two-dimensional  vectors  (^(x),  w2(x))  in  Re2 
under  a  lexicographic  order. 

4.  Prove  statement  (2)  preceding  (3.2)  in  the  proof  of  Theorem  3.1. 

5.  Describe  in  your  words  the  effect  of  E  in  the  proof  of  Theorem  3.2. 

6.  Use  (3.6)  to  evaluate:  a.  (1,  1,  2,  3)  +  (0,  — 1,  —10,6);  b.  6(1,2,  3,4); 
c.  3(0,0,  1,  -I)  -  (-1,2,  —1,0);  d.  a(2,  4,  -6,  -8)  +  (1  -  a)(5,  -1,3,1). 

7.  The  scalar  product  of  real  vectors  x  —  (xlt  . . .  ,  x„)  and  y  —  (ylt . . .  ,  yn)  is 
*  •  y  -  xilh  +••+■  X^yn  =  23U  xpti.  Evaluate  a.  (1,  2,  3,  4,  5)  •  (6,  7,  8,  9, 10); 
b.  (3(0,  1, 2)  +  4(-2, 1,  3))  •  (-5(1,  0,  -1)). 

8.  Use  an  indifference  map  in  Re2  to  argue  that  the  hypotheses  of  Theorem  3.3 
do  not  imply  the  following:  if  x  -<  y  and  0  £  a  <  p  £  1  then  {lx  +  (1  —  p)y  -< 
ax  +  (1  -  <x)y. 

9.  Show  that  (3.7)  and  (3.8)  cannot  hold  under  the  stated  hypotheses. 

10.  Show  that  Lemma  3.1  remains  valid  when  <  y  <  z”  is  replaced  by 
“x  <y  <z  and x  <  s.” 

11.  Prove  that  the  conclusion  of  Theorem  3.4  remains  valid  when  condition  1 
of  its  hypotheses  is  replaced  by  “the  transitive  closure  of  -<  on  X  is  asymmetric.” 
(See  Exercise  2.5.) 

12.  The  discrete  topology  for  any  set  X  is  the  set  of  all  subsets  of  X.  Is  every  real¬ 
valued  function  of  X  continuous  in  the  discrete  topology?  Why?  What  does  this 
say  about  continuity  when  X  is  finite? 

13.  Show  that  any  bounded  closed  interval  [a,  b]  in  Re  with  a  <  b  is  not  in  CIL. 

14.  Let  -<  on  X  =  [0,  2]  be  defined  by :  x  ■<  y  if  (x  <  y  and  x,  y  e  [0, 1])  or  if 
(y  <  x  and  x,  y  e  [1,  2]);  x  ~  (2  —  2x/3)  when  x  e  [0, 1/2)  and  x  ~  (5/3  —  2x/3) 
when  xe  [1/2,  1].  Show  that  there  is  a  u  on  X  that  satisfies  (3.1)  and  that  no  such 
u  can  be  continuous  in  the  relative  usual  topology. 

15.  For  the  proof  of  Theorem  3.5  show  that  {x:u(x)e(6,  oojje'n  for  every 
be  Re. 

16.  A  topological  space  ( X ,  V)  is  connected  if  X  cannot  be  partitioned  into  two 
nonempty  subsets  both  of  which  are  in  TS.  Prove  that  if  (X,  "G )  is  connected,  if  u 


Exercises 


41 


on  X  is  continuous  in  "S,  and  if  u(x)  <  u(y)  for  x,  y  e  X,  then  for  each  c  e  (u(x),  u(y)) 
there  is  a  z  e  X  such  that  u(z)  =  c. 

17.  Show  that  any  rectangular  subset  of  Re"  is  connected. 

18.  Let  A'  be  a  rectangular  subset  of  Re".  With  x,yeX,  the  line  segment  L  — 
{nx  -f  (1  —  a)y :  a  G  [0, 1]}  between  x  and  y  has  the  relative  topology  TS'  =»  {A  n 
L:A  e  CU/'}.  (a)  Given  the  result  of  the  preceding  exercise  show  that  (L,  G')  is 
connected.  ( b )  Suppose  u  on  X  is  continuous  in  {A  n  Y:Ae  HI"},  and  let  u'(z)  — 
u(z)  when  ze  L.  Argue  that  u  on  L  is  continuous  in  '6  . 

19.  In  the  proof  of  Theorem  3.4  show  that  if  x  <  y,  then  there  is  a  Tx  e  "t>  = 
{A  r\  X\A  s  4U>"}  such  that  xe  Tx  and  z  -<  y  for  every  z  e  Tx. 

20.  Let /be  a  bounded,  real-valued  function  on  X  and  let  v  on  X  be  defined  by 
v(x)  =  sup  {inf  {f(y):y  G  T}:x  e  T,  Te  '&“}•  Show  that  v  is  lower  semicontinuous  in 
the  topology  'G. 

21.  Wold  (1943).  Condition  W:  if  x  y  and  y  -<  z  then  xx  +  (1  —  a)  z  ~y 
for  some  ae  (0, 1).  Show  that  the  conclusions  of  Theorems  3.3  and  3.6  remain 
valid  when  condition  W  replaces  condition  3  of  Theorem  3.3.  Also  show  by  in¬ 
difference  curves  in  Re2  that  a  need  not  be  unique.  (See  Exercise  8.) 

22.  Use  the  results  of  Exercises  16  and  1 8  io  show  that  if  AT  is  a  rectangular  subset 
in  Re”,  if  u  on  X  is  continuous  in  {A  n  X.A  e  *Un},  and  if  conditions  1  and  2  of 
Theorem  3.3  hold,  then  condition  3  and  condition  W  (Exercise  21)  must  hold  also. 

23.  A*  £  Re"  is  convex  o  olx  +  (1  —  <x)y  e  X  whenever  x,  ye  X  and  a  6  (0,  1). 
Show  that  “Af  is  convex”  and  “(A\  {A  n  X:  A  e  TL"})  is  not  connected”  cannot 
both  be  true.  Assuming  that  X  is  convex,  use  this  result  along  with  that  of  Exercise 
16  to  conclude  that  if  there  is  a  real-valued  function  u  on  X  that  satisfies  (3.1)  and  is 
continuous  in  {A  n  X:  A  e  Tl"}  then  condition  3  of  Theorem  3.3  and  condition  W 
must  be  true.  Thus,  regardless  of  whether  condition  2  of  Theorem  3.3  holds, 
condition  3  must  hold  when  A'  is  a  convex  subset  of  Re"  in  order  that  there  be  a 
u  on  X  that  satisfies  (3.1)  and  is  continuous.  But  note  also  from  Exercise  14  that 
there  can  be  a  u  on  X  satisfying  (3.1)  when  condition  3  fails  and  X  is  convex. 


FINITE  SETS 


Except  for  Chapter  6,  the  remaining  chapters  of  Part  I  examine  special  kinds 
of  preferences  and  utilities  that  might  arise  in  multiple-factor  situations. 
Chapter  3  has  already  considered  some  basic  theory  for  « -dimensional 
Euclidean  spaces.  This  chapter  and  the  next  deal  with  additive  utility  repre¬ 
sentations  for  preference  orders  on  sets  of  n-tuples.  Section  4.3  considers 
lexicographic  utility. 

Throughout  this  chapter  we  shall  usually  assume  that  X  is  a  nonempty 
subset  of  the  Cartesian  product 

njr,  =  xlXArsX---xx, 

of  n  other  finite  sets.  Thus,  each  alternative  in  X  is  an  n-tuple  x  —  (»1, . , . ,  xj. 
Each  X{  is  a  factor  or  attribute  set.  For  convenience  we  assume  that  each 
xt  e  Xi  is  the  zth  component  of  some  x  e  X, 

The  subscript  /  could  refer  to  n  different  attributes  or  performance  charac¬ 
teristics  of  competing  alternatives,  it  could  refer  to  a  time  factor  {n  periods), 
and  so  forth.  We  shall  identify  conditions  for  <  on  X  that  lead  to  additive 
utility  representations  such  as  the  one  for  weak  orders :  x  <  y  <=>  u^x^  +  ■  * 
+  «„(*»)  <  «i(Sh)  +  •  •  •  4-  un(yn). 

It  should  be  emphasized  that  <  is  applied  to  pairs  of  complete  n-tuples,  or 
whole  alternatives.  In  multiple-factor  situations  it  often  seems  natural  to 
think  in  terms  of  a  preference  order  for  each  factor  and  then  to  wonder  how 
these  ought  to  be  combined  or  synthesized  into  an  overall  preference  order. 
However,  this  approach  presupposes  a  certain  kind  of  independence  among 
the  factors,  namely  that  the  order  for  a  given  factor  is  independent  of  the 
particular  levels  of  the  other  factors.  This  can  of  course  be  false.  For  example, 
suppose  that  (chicken  for  dinner  tonight,  chicken  for  dinner  tomorrow 
night)  <  (steak  tonight,  steak  tomorrow  night)  <  (chicken  tonight,  steak 


42 


Prgfereme  @mmg  Factors  43 

tomorrow  night)  <  (steak  tonight,  chicken  tomorrow  night).  In  this  cate, 
preference  for  tonight  clearly  depends  on  what  is  assumed  about  tomorrow 
night.  Under  the  hypothesis  of  chicken  tomorrow,  steak  is  preferred  tonight. 
Under  the  hypothesis  of  steak  tomorrow,  chicken  is  preferred  tonight. 

For  situations  where  the  independence  conditions  seem  reasonable  and 
additive  utilities  apply,  Fishburn  (1967)  summarizes  a  number  of  ways  to 
estimate  factor  utilities  so  as  to  satisfy  the  additive  representation, 

4.1  PREFERENCE  INDEPENDENCE  AMONG  FACTORS 

Consider  a  two-dimensional  case  where  X  —  Xx  x  X2,  <  is  a  weak  order 
and,  for  each  xx,  yl  e  Xx  and  xs,  y2  e  X2, 

(*i.  %>  <  (Vi,  xs)  =>  (*n  Vi)  <  (Vi>  Vi),  (4.1) 

(*i»  *i)  <  (*i>  Vt)  -=>  CSfi,  *2)  <  (Vu  Vz)-  (4.2) 

The  first  of  these  says  that,  if  we  define  xx  <x  yx  «=>  (xx,  x2)  <  (ylt  x2)  for 
some  x2eX2>  then  <j  is  a  weak  order  on  Xx  that  is  independent  of  the 
particular  element  used  from  X2.  Similarly,  the  second  says  that,  when  the 
first  factor  is  fixed,  there  will  be  a  weak  order  <2  on  X2  derived  in  the  natural 

way  from  <  that  does  not  depend  on  the  element  used  from  Xx.  In  the 

simplest  possible  way  this  suggests  that  Xx  and  X2  are  independent  in  a 
preference  sense. 

As  demonstrated  by  Scott  and  Suppes  (1958),  even  in  the  two-mmensional 
case  considered  above  it  may  be  necessary  to  go  beyond  (4.1)  and  (4.2)  to 
obtain  an  additive-utility  representation  of  the  form  (xx,  x2)  <  (yx ,  y2)  <=> 
ux(xx)  +  <  uiiVi)  +  w2(jf 2).  Clearly,  (4.1)  and  (4.2)  are  necessary  for 

the  existence  of  such  a  representation,  but  they  are  not  sufficient.  Suppose 
for  example  that  <  on  X  —  {1, 2,  3}  x  {1,  3,  5}  is  a  weak  order  with 

(*it  x2)  <  (yu  y2)  o  xxx2  +  (xx)**  <  yxy2  +  {yx)v\  (4.3) 

Since  u(x.  —  *;r?  -f-  («.’'**  is  st'-;-~*,y  increasing  in  x:  for  r-  «nd 

is  strictly  mw^uiag  in  x3  for  any  fixed  xx,  (4.1)  and  (4.2)  hold.  However, 
additive  utilities  do  not  exist.  To  the  contrary,  suppose  that  there  are  real¬ 
valued  functions  ux  on  Xx  —  {!,  2,  3}  and  u2  on  X2  —  {1,  3,  5}  such  that 
(*i,  *2)  <  (2/i >  Vz)  o  «i(*i)  +  u2(x2)  <  ux(yx)  +  u2{y2).  Then,  since  (2,  1)  ~ 
(1,3)  and  (1, 5)  —  (3, 1)  by  (4.3), 

“i(2)  +  «2(1)  =  Mt(l)  +  :/2(3) 

«i(l)  +  u2(5)  =  Wi(3)  +  «,(1). 

By  adding  these  equalities  and  cancelling  identical  terms  we  get 

“i(2)  +  «2(5)  =  ux(  3)  +  «j(3) 


44 


AdsMtme  Utilities  mih  Finite  Sets 


which,  according  to  the  presumed  existence  of  an  additive  representation, 
yields  (2,  5)  (3,  3).  But,  by  (4.3),  u( 2,  5)  ~  42  and  u( 3,  3;  —  36,  so  that 

(3,  3)  <  (2,  5).  Hence  there  is  no  additive  representation  for  this  case. 

Additive  Utilities 

In  generalizing  independence  conditions  like  (4,1)  and  (4.2)  we  shall  use  a 
sequence  of  equivalence  relations  Em  on  Xm  ( m  —  2,  3, .  . .)  where  Xm  is  the 
m- fold  Cartesian  product  of  X  with  itself. 

Dtlkkion  4.1.  (z\  .  . . ,  xm)  Em  (y1, . .  . ,  ym)  if  and  only  if  m  >  1,  x\ 
yi  e  X  for  j  —  l, ...  ,m,  and  with  X  S  IJJLi  x%  it  is  true  for  each  /  that 
Xf, . . .  ,  x™  is  a  permutation  (reordering)  of  y\, .  , .  ,  yf . 

Thus,  for  (4.1),  ((xu  xs),  (yx,  y2))  Ez  ((yu  x.j,  (xu  yj),  and  in  the  example 
refuting  additivity  for  (4.3),  ((2,  1),  (1,  5),  (3,  3))  Ea  ((I,  3),  (3,  1),  (2,  5)). 
With  n  —  3  and  (xls  x2,  x3)  =  (net  profit,  market  share,  dividend  per  share 
of  stock),  the  following  arrays  reveal  that  (x1, .  . .  ,  ad)  £4  (y1, .  . .  ,  y*). 


profit 

share 

dividend 

profit 

share 

dividend 

X1 

Sim 

20% 

30<t 

y1 

$2m 

20% 

504 

X* 

SOm 

10% 

504 

y 2 

—  Sim 

10% 

454 

a* 

$2m 

30% 

454 

SI  m 

15% 

104 

~i  ! 
* 

-$lw 

15% 

104 

y* 

SOm 

30% 

304 

For  the  purpose  of  further  discussion  we  shall  first  present  three  additive 
utility  theorems.  For  comparative  convenience  they  are  presented  together 
in  Theorem  4.1.  There  is  a  theorem  A,  a  theorem  B,  and  a  theorem  C,  with 
“hypotheses”  and  “conclusions”  noted  accordingly, 

THEOREM  4.1.  Suppose  X  Q  XJLi  Xi is  finite-  Then 


A. 

{(x\  . .  . 

,  xm)  Em  (y\  . , 

■  • ,  ym), 

xi  < 

yi 

or 

z’  —  1/ 

for 

j  =  1 .  •  • 

m  — 

1 1  =>  not 

xm  <  ym; 

B. 

[(*S . . . 

,  xm)  Em  (y1, . . 

.  >  ym), 

x}  < 

y* 

or 

x’  yj 

for 

m  — 

1 ] =>  not 

xm  <  yn; 

C. 

K*1.  •  •  • 

,  xm)  Em  (y\  . 

**  < 

y‘ 

or 

xs 

for 

m  —  l]onot  x'"  <  ym; 

for  all  x1, .  . .  ,  xm ,  yl,  . . .  ,  ym  e  X  and  m  —  7,  3, . . .  ,  if  and  only  if  there  are 
real-valued  functions  ult . . .  ,  un  on  Xlt . . .  ,  Xn  respectively  such  that,  for  all 
x,  y  €  X, 

A*,  x  <y=>  SLt  ui(xi)  <  2,=i  “<(?«)•' 

b*.  x  <,  y  => <  II  1  ui(y,h **y=> Sr.i »  2?-i ui(y<); 

C*.  X  <  ui(xi)  <  Till  «.V^) 


Indifference  (W)and  <%?  are  defined  as  x  --w  y  <*>  (not  x  <  y,  miy  •<  z)  and 
x  y  <=>  (s  '-w  x  <>  z~y,  for  all  2  e  if),  as  in  Chapters  2  and  3. 

Unlike  (4.1)  and  (4.2),  the  conclusions  of  A,  3,  and  C  are  stated  in  the 
negative.  It  is  easily  seen  that  A  is  necessary  for  A*,  that  3  is  necessary  for 
3*,  and  that  C  is  necessary  for  C*.  For  example,  suppose  that  A*  holds  and 
that  the  hypotheses  of  A  hold  with  (x1, ....  xm)  Em  ( y 1i . . . ,  ym )  and  z*  <  yi 
or  xs  -  yi  for  each  j  <  m.  Then,  by  A *,  Zf-~i  ILi  ui(xD  ySy\)- 

But,  by  Em,  zti  iLi  ui(xl)  *  *2li XLi »<(^)-  Therefore  «,(??*)  < 
XLi  «<(*”),  which  by  A*  implies  not  xm  <  y"“,  which  is  the  conclusion 
of  d. 

We  shall  consider  the  sufficiency  of  A  for  A* ,  B  for  B*,  and  C  for  C*  in  the 
next  section.  These  sufficiency  proofs  will  be  based  on  a  theorem  from  linear 
algebra  called  the  Theorem  of  The  Alternative,  which  will  be  proved  in  the 
next  section. 

Farther  Remarks  on  Independence  Conditions 

Each  of  conditions  A,  B ,  and  C  in  Theorem  4.1  is  actually  a  denumerable 
bundle  of  conditions,  one  for  each  equivalence  Em,  m  =  2,  3, ....  If  we  let 
An,  B^,  and  Cm  denote  the  part  of  condition  A,  B,  and  C  that  applies  to 
Em,  then  Am+1  =>  Am,  Bm+l  =>  Bm,  and  Cm+X  =>  Cm  for  all  m  ^  2.  However, 
as  suggested  by  Scott  and  Suppes  (1958),  there  is  no  one  finite  value  of  m  for 
which  Am  =>  A*  or  Bm  =>  B*  or  Cm  =>  C*  for  all  finite  sets  X.  We  now 
consider  some  of  the  other  aspects  of  A,  J9,  and  C. 

Our  main  purpose  in  including  xi  =  yi  in  the  hypotheses  of  A  was  to  get 
Am+ 1=>  Am,  but  the  equality  part  of  the  hypothesis  of  A  is  unnecessary. 
Although  A  does  not  imply  that  <  is  a  strict  partial  order  since  it  does  not 
imply  transitivity,  it  does  say  that  if  x1  <  x2,  x2  <  x3, .  . .  ,  xm-1  <  zm  then 
not  xm  <  x1.  This  follows  from  the  fact  that  (a?1,  x2, . .  . ,  xm )  Em  ( x a, . . .  , 
xm,  x1).  Hence  when  A  holds,  the  transitive  closure  of  <  (Exercise  2.5)  is  a 
strict  partial  order. 

Like  A,  B  does  not  imply  that  <  is  a  strict  o-rt'al  etder.  For  example, 
suppose  X  =  { x ,  y,  z,  /}  and  <  =  {x  <  y,  y  <  z,  x  <  /}  with  ~  elsewhere. 
Then  holds  for  no  distinct  pair  of  elements  in  X  so  that  B  reduces,  in 
effect,  to  A.  Since  A  is  consistent  with  <  as  given  and  <  is  not  transitive,  B 
does  not  imply  that  <  is  a  strict  partial  order. 

On  the  other  hand,  B  does  imply  that  ««  is  an  equivalence  since  it  implies 
asymmetry  on  considering  (x,  y )  Ea  ( y ,  x),  and  asymmetry  of  <  implies  that 
is  an  equivalence  (Exercise  2.3).  B  implies  also,  as  in  the  conclusions  of 
Theorem  2.3,  that  (x  <  y,  y  s*  z)  =>  x  <  2  and  that  (x  g#  y,  y  <  z)  x  <  z. 
For  example,  since  (x,  y,  2)  £3  ( y ,  2,  x),  x  <  y  and  y  w  z  imply  not  2  <  x  by 
B.  Hence  either  x  <  2  or  1^2,  If  x  ~  z  then  x  by  the  definition  for 
y  z.  But  x  <  y.  Hence  x  <  2. 


Ad£th*  Ut UHks  wish  Fimie  Se Is 


C  of  course  implies  that  <  is  a  weak  order.  Suppose  aot  x  <  y  and  not 
y  <  z.  Then  y  <  x  or  y  ~  x,  and  2  <  y  or  2  ^  y,  so  that,  since  {y ,  z,  *)  £, 
(as,  2},  C  implies  not  a:  <  2.  Hence,  C  implies  that  <  is  negatively  transi¬ 
tive.  Asymmetry  follows  from  (a?,  y)  £a  (y,  x). 

Remarks  on  Additive  Utilities 

It  should  be  noted  that  if  additive  utilities  exist  in  the  sense  of  B*,  or 
C* ,  then  it  does  not  follow  that  any  utility  function  u  on  X  that  preserves  the 
preference  order  can  be  written  in  an  additive  form.  For  example  suppose  in 
connection  with  C*  that  x  <  y  o  u(x)  <  u(y).  Then  it  may  be  impossible  to 
write  u  in  an  additive  form  when  C*  holds.  What  €*  says  is  that,  among  all 
functions  u  that  satisfy  x  <  y  o  u(x)  <  u(y),  there  is  at  least  one  that  can  be 
written  in  the  additive  form  as  it(x)  ~  ufxx)  4-  *  * ■  4-  un(xn). 

It  cannot  be  emphasized  too  strongly  that  additive  utilities  might  not  exist 
in  some  situations  where  their  use  seems  attractive  for  ease  in  analysis. 
Possibly  the  best  way  to  test  condition  A,  or  B,  or  C,  is  to  try  deliberately  to 
find  ^-tuples  in  X  that  violate  the  condition.  An  inability  to  construct  a 
violation  would  lend  support  to  the  credibility  of  the  condition.  Another 
obvious  way  of  testing  for  additivity  is  to  obtain  a  set  of  preference  state¬ 
ments,  convert  these  into  additive  utility  inequalities  and  equalities  (for  C 
when  ~  arises)  and  test  this  system  for  the  existence  of  a  solution.  If  no 
solution  exists  then  a  violation  of  the  appropriate  condition  has  been 
uncovered. 

4.2  THEOREM  OF  THE  ALTERNATIVE 

To  prove  the  sufficiency  of  the  conditions  A,  B,  and  C  of  Theorem  4.1, 
we  shall  use  the  following  theorem,  which  is  discussed  by  Tucker  (1956,  p.  10), 
Goldman  (1956),  and  Aumann  (1964,  p.  225),  and  which  has  been  used  by 
Tversky  (1964),  Scott  (1964),  and  Adams  (1965)  to  prove  theorems  like 
Theorem  4. 1 .  Rev  is  A-dimensional  Euclidean  space  and  c  •  xk  «=  c}x*. 

THEOREM  4.2  (THEOREM  OF  THE  ALTERNATIVE).  //  x\  ...  , 
x'1  (=  Re*v  and  i  <  K  <,  Mt  then  either  there  is  a  c  £  Re^  such  that 

c  •  xk  >  0  for  k  =  1 , . . .  ,  K  (4.4) 

c  •  =  0  for  k  -  K  +  1 . M,  (4.5) 

or  there  are  non-negative  numbers  rlt ...  ,rK  not  all  of  which  equal  zero  and 
numbers  rK+1,  .  .  .  ,  rM  such  that 

2  rkx)  =  0  for  j  ~  \ . N. 

1 


(4.6) 


Proof.  Lei  S  *  {**, . . . ,  xA")  and  T  —  ,  a*M}.  Let  5  be  the 

convex  closure  of  S  so  that 


/  m  \ 

S  =  Jx:  a;  =  £  A,-a*  with  £  A(  —  1,  m  >  0,  and  A,  J>  0  and  a 1  S  for  all  i  j, 
and  let  7'  be  the  vector  space  generated  by  T  so  that 


T  = 


T  o,b*  with  m  >  0  and  tr*  e  Re  and  b*  e  T  for  all  /]  U  0 
»» 1  ! 


where  0  is  the  origin  of  Re‘v.  When  K  —  M,  T  =  0  and  T'  ~  0. 

The  two  alternatives  depend  on  whether  5  and  T‘  have  a  common  element. 
If  S  n  T‘  0  then  (4.6)  holds  as  is  seen  from  ][  —  0  or  2  Aiai  — 

2  =  0  with  the  obvious  definitions  of  the  rk  in  terms  of  the  (k  <,  K) 

and  at  (k  >  K). 

On  the  other  hand,  (4.4)-(4.5)  hold  when  S  r\T  —  0 .  Since  both  S  and 
T  are  finite  sets  (and  this  is  critical  to  the  conclusion),  it  can  be  shown  that 
there  are  vectors  s  e  S  and  t  e  T'  such  that  (x  —  y)z  ;>  (s  —  t)2  >  0  when 
x  e  S  and  y  e  V.  The  x2  =  x  •  x  (not  to  be  confused  with  x2  e  S ).  Let  x  e  5. 
Then,  with  0  <.  X  <,  1,  (1  —  A)s  +  Ax  s  S.  Since  /  e  T',  (1  —  X)t  e  T'.  Hence 
[(1  —  A)s  +  Ax  —  (1  —  A)r]2  >  (s  —  r)2,  which  reduces  to  2A($  —  t)  * 
( x  —  (s  —  /))  +  A2((s  —  /)  —  a:)2  ;>  0.  Take  A  >  0,  divide  by  A,  and  let 
A  approach  0:  this  leaves  (s  —  t)-  (x  —  (s  —  t ))  ;>  0,  or  (s  —  t)  ■  x  ;> 
(s  —  0a  >  0,  or  (5  —  /)  ’  x  >  0.  Let  c  =  s  —  t.  Thus  c  ■  x  >  0  for  all  x  e  S, 
so  that  (4.4)  holds. 

To  verify  (4.5)  when  K  <  M  and  S  C\T'  =  0 ,  take  y  e  T'.  Then  ay  -f 
t  gT  so  that  {ay  +  t  —  s)2  >  (s  —  t)2,  or  o2y2  ^  2 ay  *  (j  —  t)  =  2<rc  *  y. 
First,  take  <r  >  0,  divide  by  a  and  let  a  approach  0.  Since  y2  [>  0  this  leaves 
0  ;>  c  •  y.  Second,  take  <r  <  0  and  divide  by  a  giving  ay 2  <i  2c -y.  Letting 
a  approach  0  from  below  gives  0  <i  c  •  y.  Hence  c  •  y  =  0.  ♦ 


In  the  following  I  shall  detail  only  the  proof  that  B=>  B*  in  Theorem  4.1. 
The  proof  that  C  C*  is  entirely  similar  since  C*  is  equivalent  to  x  <  y  => 
2  «<(*<)  <  X  ui(y i)  and  ~  ~  3/  =>  2  «<(*<)  =  2  Which  is  like  B*  with 

an  replaced  by  The  proof  that  A=>A*,  as  given  by  Adams  (1965), 
involves  only  (4.4)  from  Theorem  4.2  and  not  (4.5)  since  there  are  no 
equality  implications  in  .4*. 


Sufficiency  Proof  of  Theorem  4.1  B.  Let  B  hold.  For  the  application  of 
Theorem  4.2  we  let  N  equal  the  size  of  plus  the  size  of  X%  •  •  •  plus  the  size 
of  Xn  and  let  c  —  («i(xu),  18), . . .  ,  u„(xnt))  with  N  components.  Let  K 

be  the  size  of  <  (the  number  of  x  <  y  statements)  and  let  M  —  K  be  half 
the  size  of  w  —  =,  containing  exactly  one  of  x  sa  y  and  y  sy  x  for  each  such 


Additive  Utilities  Finite  Sen 


x,  y  pair  for  which  x  y,  The  K  <  statements  in  the  conclusion  B *  and  the 
M  ■—  K  statements  translate  into  the  equivalent  system 

c  •  ta*  >  0  for  k  —  1 ,  .  . .  ,  K  (r*  <  yk)  (4.7) 

c  ■  tffc  =  0  for  k  ~  K  Ar  I ,  M  (xk  a»  f)  (4,8) 

where  each  a)  e  {—  1, 0.  1}  and  V*') ,  ^  =  0  for  each  k.  B*  holds  if  and  only 

if  (4.7)  and  (4.8)  have  a  c  solution. 

Suppose  there  is  no  such  c  solution.  Then,  by  Theorem  4,2,  there  are 
rk>  0  for  k  —  i K  with  rk  >  0  for  some  k  <,  K,  and  rK+l. ....  rit 
such  that 

1^  =  0  for  y  (4.9) 

Because  each  a )  is  rational  there  is  a  set  of  rational  and  hence  integer  rk  that 
satisfy  (4.9).  If  some  of  these  integer  rk  for  k  >  K  are  negative  they  can  be 
made  positive  by  replacing  ak  with  —  ak  in  (4.8)  and  (4.9)  and  replacing  rk 
by  —rk  in  (4.9),  which  does  not  essentially  change  (4.8)  or  (4.9)  and  is 
legitimate  from  the  standpoint  of  (4.8)  since  fw  is  symmetric.  Then,  with 
all  rk  ;>  0,  (4.9)  says  that  (r^’s,  r2^2’s, . . . ,  rMxM' s)  ETl+ri+,..+ril  (w'' *. 

,  rMyM' s)  with  xk  <  yk  for  k  ~  1 , . . .  ,  K  and  xk  e#  yk  for  k  — 
K  -f  l,  M.  Since  some  /*>  0  for  &  <;  X  it  follows  that  B  does  not  hold, 
for  if  2  rk  ~  1  then  irreflexivity  of  <  (implied  by  B)  is  violated  and  if 
2,  rk  >  i  then  B  is  violated  as  it  stands.  Since  B  is  in  fact  assumed  to  hold  it 
must  be  false  that  there  is  no  c  solution  for  (4.7)  and  (4.8).  ♦ 

4.3  LEXICOGRAPHIC  UTILITIES 

The  purpose  of  this  section  is  to  note  an  affinity  between  additive  utilities 
and  lexicographic  utilities.  For  the  latter  case  we  define  <L  for  real  vectors 
a  ~  (flj, . . . ,  an)  and  b  =  (bu  . . .  ,  bn ): 

a  <h  boa  b  and  bk  <  <Jt=>  <  bj 

for  some  j  <  k,  k  =  2, . . . ,  n.  (4.10) 
Thus  a  <L  b  o  ak  <  bx  or  [ax  =  bu  as  <  bt]  or  •  ■  •  or  [at  =  bu  . .  . , 

®n-l  “  ^n— 1>  ^ it  4  ^n]' 

In  comparison  with  Theorem  4.1  A  we  shall  consider  the  existence  of  real- 
valued  functions  uu  . . .  ,  un  on  Xx, ... ,  Xn  such  that 

x<y=>  (ttita), ....  u„(xn))  <L  (MifyO, . . . ,  un(yn)).  (4.1 1) 
The  comparison  for  Theorem  4. 1C  is 

x<  yo  (UiCarJ, ....  u„(xn))  <h  (u,^), ....  u„(yn)).  (4.12) 


I  if  XKtrqrap&tt  l  Isiisitf 


In  both  ea-scs  the  order  ct  the  X,  is  very  significant  In  a  preference  sense, 
(4.1 1)  or  (4  12}  says  approximately  that  A',  dominates  A"a.  Xt  dominates  Xt, 
and  so  forth. 

The  main  point  to  be  made  about  (4.1  i)  and  (4,12)  is  that  condition  A  of 
Theorem  4.1  is  necessary  for  (4.1 1).  and  condition  C  is  necessary  for  (4.12). 
For  example,  suppose  (4. !  1 )  holds  along  with  the  hypotheses  of  condition  A : 
(*\  - .  •  ,  xm)  ly1. .  . .  ,  y'n)  and  xf  <  y}  or  xi 3 * * *  *s  y*  for  j  —  1,  , . .  ,  m  —  1. 
Then  u^xf  <,  ut(y{)  for  all  /  <  m,  and  since  J”  ut(x()  —  ufyf)  by  £m, 
«i  (y'i)  <  If  u,(x()  <  ut(y[)  for  some  j  <  m  then  ufyf)  <  ^(xf)  so 

that  (ux(y™),  ....  «„(?*))  <7'  (tti(xf), . . .  .  u„(*™)).  If  «i(x[)  «.  for  all 
j  <  m  then  u^y™)  =  ^(x**),  in  which  case  we  repeat  the  analysis  just  given, 
using  wa  instead  of  u,.  Continuing  this  we  conclude  that  either  («i(y“), .  . .  f 
«n(y»))  <i  (»iW.  •  •  •  *  #«(*?))  or  else  that  the  two  utility  vectors  are  equal. 
It  then  follows  from  (4.11)  that  not  xm  <  ym,  which  is  the  conclusion  of 
condition  A. 

Thus,  if  X  is  finite  and  lexicographic  utilities  exist  in  the  sense  of  (4.11) 
then  additive  utilities  exist  in  the  sense  of  A*  in  Theorem  4.1.  A  similar 
assertion  holds  for  (4.12)  and  C*.  Clearly,  the  converses  of  these  assertions 
are  not  generC'v  valid. 

What  is  required  for  (4.12)  in  addition  to  condition  C?  Clearly,  something 
like  the  following  is  needed. 

Condition  L.  If  x  <  y  when  xi  —  ytfor  all  i  except  i  ~  k,  then  x*  <  y* 
when  (xf  =  xh  yf  =  y.)  for  all  i  <  k,  provided  x,  y,  x*,  y*  e  X. 

Interestingly  enough,  when  this  lexicographic  dominance  condition  is 
used  and  X  =  n  xit  it  is  no  longer  necessary  to  use  all  of  condition  C.  The 
following  uses  only  the  m  =  2  part  of  C.  We  can  also  remove  the  strict 
finiteness  assumption  for  X. 

THEOREM  4.3.  Suppose  X  is  countable  and  X  =  flLi  -^c  Then  (4.12) 
holds  for  all  x,  y  e  X  if  and  only  if 

1.  <  is  negatively  transitive, 

2.  [(an  2)  E2  ( y ,  n),  x  <  y  or  x  ~  y]  =>  not  *  <  w, 

3.  condition  L  holds. 

Sufficiency  Proof  Under  the  given  conditions  define  x,  <(.  ^  02  <  w 
for  some  2,  w  e  X  such  that  z}  =  ny  for  all  j  i  and  (z(  =  x,,  w,  =  yf.  Then 

<,  on  Xi  is  a  weak  order:  asymmetry  follows  from  condition  2  and  negative 
transitivity  follows  from  condition  1  and  X  —  J][  Xt.  Then,  by  Theorem  2.2, 

there  is,  for  each  /,  a  real-valued  function  w,  on  X,  such  that  xt  < ,  yt  <=> 
«<(*<)  <  ufyf. 

Suppose  that  (iiifa),  •  •  • ,  un(xn))  <L  (ufyf, ...  ,  un(y„))  and  let  t  be  the 
smallest  /  for  which  i/,(x,)  <  u^yf,  with  u,(x.)  —  «,(y,)  for  all  i  <  t.  We  wish 


50 


Additive  Utilities  with  Finite  Sets 


to  show  that  x  <  y.  To  do  this  we  first  note  that,  if  1  <  r,  (not  xx  yu  not 
V\  <1*1)^  (* i,  *2.  •  •  •  >  xn)  ~  (yi,  z2, . . .  ,  sn).  Similarly,  if  2  <  f,  w8(x8)  = 
“2(2/2)  =>  (Vi.  xs,  ■  ■  • ,  xn)  (2/1,  y8,  *3.  •  •  • »  *«)•  Continuing  this  and  using 
the  transitivity  of  ~  (from  weak  order),  we  get  (xx, ...  ,  xn)  ~  (yx, . . .  t 
Vi-\y  *«)•  Now  **  <t  yt.  Therefore  (yx, ...  ,  yt_lt  xt, . . .  ,  xn)  < 

(ylf . . .  ,  yt- 1,  y*,  **+i, . . . ,  *n)  by  the  definition  of  <t  and  condition  2. 
Hence,  by  condition  L,  (yl9 . . .  ,  y^,  a?<f . . .  ,  xn )  <  (yx, . .  .  ,  y{_x,  yt, 
y<+i»  •  •  •  ,  y„).  Thus,  by  Theorem  2.1,  s  <  y. 

On  the  other  hand  suppose  that  not  (uifo), ...»  «„(*„))  <L  («i(yi), . . . , 
«»(yn))-  Then  either  (^(yO, . . . ,  M„(yn>)  (^(*1), . . .  ,  wn0*„)),  in  which 

case  y  <  x  and  hence  not  x  <  y,  or  else  the  two  utility  vectors  are  equal,  in 
which  case  the  ^  analysis  of  the  preceding  paragraph  leads  to  a ;  ^  y  and 
hence  not  x  <  y.  ♦ 

4.4  SUMMARY 

When  X  is  a  finite  subset  of  the  Cartesian  product  of  n  other  sets,  additive 
utilities  for  several  cases  considered  exist  if  and  only  if  appropriate  independ¬ 
ence  conditions  hold.  The  finiteness  cf  X  is  crucial  for  these  cases  in  the 
absence  of  additional  conditions.  However,  in  a  weak  order  case  with 
x=nx„  the  finiteness  condition  can  be  replaced  by  a  countability  con¬ 
dition  in  a  simple  axiomatization  for  lexicographic  utility  according  to  a 
definite  dominance  order  for  the  n  factors.  With  X  finite,  lexicographic 
utilities  imply  the  existence  of  additive  utilities,  but  the  converse  is  not 
generally  true. 

It  cannot  be  emphasized  too  strongly  that  additive  utilities  might  not  exist 
in  some  situations  where  their  use  seems  attractive  for  ease  in  analysis. 


INDEX  TO  EXERCISES 

1.  The  size  problem.  2.  Weak  orders  and  additivity.  3-4.  Functional  forms  which  may 
or  may  not  admit  additive  utilities.  5.  Condition  C4.  6.  The  necessity  of  all  of  C.  7-8. 
Variations  on  C.  9.  Em.  10.  Necessity  of  B*  and  C*.  11-12.  B  and  A.  13-18.  Applications 
of  the  Theorem  of  The  Alternative.  18.  Proof  of  Theorem  2.9.  19.  <L.  20.  Admissible 
transformations. 


Exercises 

1.  Suppose  X  —  X(  and  each  has  10  elements.  Then  X  has  10  billion 
elements  but  there  are  only  100  x(.  Discuss  the  potential  attractiveness  of  additive 


utilities  from  the  standpoint  of  size  and  the  number  of  utility  value*  ‘hat  reed  to  be 
estimated. 

2.  Let  X  ---  {a,  b\  x  If,  d).  How  many  preference  weak  orders  can  be  defined 
on  XI  List  those  for  which  additive  utilities  exist  as  in  Theorem  4  ICT 

3.  With  X  »•  Xy  x  Xtt  X(  —  {!,  2, . . . ,  M)  with  M  large,  and  x  <  y<r> 

u(xlt  x2)  <  u{j/lt  t/£),  for  which  of  the  following  cases  do  there  exist  ut  and  «8  so 
that®  ■<  go-ufa)  +  u2(x.J  <  +  «afe'a)?(a>  «<*».**)  (*)#(;=!,  **)  - 

®1  ■+  *i*8 ;  (c)  «(*i>**)  —*i  +  *a  +  *1**;  (<0  *2)  =  sup  fe,**};  (e)  «(xs, 

*a)  —  l*i  ~  *?i  (absolute  value):  if)  u(xltxf)  *  (p)  ttixj,,  x.J  —  zj 

{*1  +  »•*).  For  each  that  admits  additive  utilities,,  tell  why  this  is  so. 

4.  Let  A"  —  Tj  x  X2,  with  X{  and  X2  »vis  of  positive  integers  and  suppose  that 
if(xj,x3)  —  XjXjj  +  (XjXg)3  and  z  -<y~i^-u(x t,xg)  <  u(ylf  yf.  Show  that  additive 
utilities  exist  in  the  sense  of  Theorem  4. 1 C*. 

5.  The  accompanying  utility  matrix  gives  u(a,  p)  for  (a,  p)  e  X  *>  {«lf . . . ,  x 
{/'ll  ■  •  •  >Pi\-  Assume  that  (a.p)  <  (a' ,p')  <r>  u{a,p)  <  u{a,p), 


Pi 

Pi 

Pa 

Pi 

ak  0 

4 

S 

9 

a2  5 

9 

12 

14 

«3  » 

11 

13 

15 

«4  10 

15 

16 

17 

and  show  that  condition  C4  (C  with  m  -  4)  of  Theorem  4.1  fails.  Does  condition 
C3  hold? 

6.  Let  X  =  {(1,  1),  (2,  2) . (ro,  «),  (1,2),  (2,  3), (m,  1)}. 

Let  u(j,k)  =0  for  all  ( j,  k)  e  T  except  for  (/w,  1)  where  u(m,  1)  >0,  and  take 
x  ■<  y  o  u(x)  <  u(y).  Show  that  condition  Cm  of  Theorem  4.1  fails  but  that 
condition  Cm_ j  holds. 

7.  Show  that  condition  C  of  Theorem  4.1  implies  that  if  (z1, . . . ,  :cm)Em(yx, . . .  , 
ym)  and  x1'  -<  yj  for  all  j  <  m  then  ym  <,  xm. 

8.  ( Continuation .)  Tversky  (1964)  uses  an  axiom  he  calls  the  cancellation  law, 
which  in  our  terms  reads :  if  (x1, ... ,  a;7")  Em  (y1, ,  ym),  if  z1  -<  y1  or  x>  ~  y! for 
all  j  <  m,  and  if  x}  •<  y’  for  some  j  <  m,  then  ym  -<  xm.  With  ~  defined  by  (2.2) 
as  usual,  show  that  Tversky’s  condition  is  implied  by  C  and  that  C  is  implied  by 
Tversky’s  condition  plus  the  assumption  that  -<  is  irrefiexive. 

9.  Prove  that  Em  on  Xm  is  an  equivalence, 

10.  Show  that  B*  ->  B  and  that  C*  =■>  C. 

11.  Prove  that  B  implies  that  z  <  j  when  x  ^  y  and  y  -<  2. 

12.  Does  A  of  Theorem  4.1  imply  that  «»  is  an  equivalence?  Why? 

13.  Write  out  the  sufficiency  proof  for  Theorem  4.1/1. 

14.  Write  out  the  sufficiency  proof  for  Theorem  4. 1C. 


SZ  Additive  Utilities  with  Finite  Sets 

15.  With  <x  defined  by  (2.12)  for  interval  orders,  Set  condition  D  be:  [(**, . .  , , 

xm)  Em  (y1, . .  ,  ,  ym),  xs  c1  y*  for  j  =  1, . . . ,  m  —  1]  =>  not  xm  c1  ym.  Suppose 
X  £  JJ  Xt  is  finite.  Prove  that  -<  on  X  is  an  interval  order  and  condition  D  holds 
if  and  only  if  there  are  real-valued  functions  on  Xlf . . . ,  Xn  respectively 

and  a  nonnegative  real-valued  function  a  on  A" such  that,  for  all  x,yeX, 

ft  ft 

x  <y  < r>  2  «<(**)  +  °(X)  <  '£ 

I  i 

Use  the  Theorem  of  The  Alternative  in  your  sufficiency  proof. 

16.  Let  condition  E  be:  [(xl, . . .  ,  x2m)  Em  (y1, ....  y2rr"),  x1  ~yiforj 

m  andx}  -<  y'  for  j  =**  m  +  1 _ _ _  2m  —  i]  =>  «or  a2"1  •<  Show  that  if  £  holds 

and  not  x  -<  a:  for  some  a:  G  A",  then  ~<  on  X  is  irreflexive  and  asymmetric,  -<  is 
transitive,  and  -<  satisfies  p\Q  and  p  11  of  Section  2.4  and  hence  is  a  semiorder. 
(Do  not  use  the  Theorem  of  The  Alternative  here.)  Note  the  necessity  of  using 
not  x-  •<  x  for  some  xeX,  for  without  this  we  could  have  X  —  {2}  and  x  -<  x  with 
condition  E  holding. 

17.  ( Continuation .)  Suppose  X  £  JJ  X{  is  finite.  Prove  that  -<  on  X  is  irreflexive 

and  condition  E  holds  (for  m  -  1,2,...)  if  and  only  if  there  are  real-valued 
functions  ux . un  cn  Xly . . .  ,  Xn  respectively  such  that,  for  all  x,y  e  X, 

n ,  n 

x  <  y  oJ.  «<(*»•)  +  i  <  v  ufyd. 

isal  l=«l 

18.  Scott  (1964):  Proof  of  Theorem  2.9.  With  Xjf*  finite  select  one  element  from 
each  r=w  class  and  call  the  resu  ting  set  Y.  Henceforth,  work  with  Y,  Each  x  -<  y 
statement  translates  into  u{y)  —  u(x)  —  1  >  0  by  (2.20),  and  each  x  ~  y  statement 
translates  into  u(x)  +  1  —  u(y)  >  0  and  u(y)  +  1  —  u(x)  >0.  [  ^0  might  be  used 
in  the  latter  two,  but  >0  will  work  also.}  Let  N  equal  the  size  of  Y  plus  1,  with 
c  —  (u(x), . . .  ,  u(t),  1)  being  /v-dimensional. 

a.  Use  Theorem  4.2  to  show  that  if  there  is  no  c  solution  to  the  stated  inequalities 
then  there  are  sequences  ay, ... ,  xT,  q, . , .  ,  zT,  and  ylt . . .  ,  yr,  wlt . , .  ,  wT 
such  that  each  is  a  permutation  of  the  other  and  xk  <  yk,  zk  ~  wk  for  k  — 
1  T 

b.  Show  that  T  —  1  is  impossible  under  the  semiorder  axioms. 

c.  Consider  T  >  1.  Form  a  cycle  through  the  two  sequences  by  starting 
with  some  xk.  yk  is  the  second  element  in  the  cycle.  Find  yk  in  the  first  se¬ 
quence.  Then  the  third  clement  in  the  cycle  is  the  element  in  the  second 
sequence  under  yk  in  the  first.  Continue  this  until  you  reach  xk  in  the  second 
sequence,  Show  that  if  any  such  cycle  stays  wholly  in  the  xk,  yk  pairs  then 
transitivity  of  -<  is  violated. 

d.  Hence,  with  T  >  1,  a  cycle  beginnir.to  with  xk  must  pass  through  ek,  wk  pairs. 
Suppose  some  y}  =  xk.  Then  use p\  1  ofSection  2.4  to  show  that  you  can  reduce, 
bv  deletion  and  rearrangement,  the  two  T sequences  to  T  —  1  sequences  (one 
of  which  is  a  permutation  of  the  other,  with  T  —  l  -<  and  T  —  1  —  statements 


Exercises 


53 


between  the  two).  Suppose  no  yj  «*  xk.  Use  p  10  of  Section  2.4  to  show  that 
the  two  T  sequences  can  be  reduced  to  corresponding  T  —  1  sequences. 
e.  Conclude  the  proof  of  Theorem  2.9. 

19.  Verify  that  <L  on  a  set  of  ^dimensional  real  vectors  is  a  strict  order.  (See 
Definition  2.16.) 

20.  When  Theorem  4. 1C*  holds  with  X  finite,  discuss  the  nature  of  transfor¬ 
mations  on  the  Ui  under  which  C*  will  remain  valid.  Do  the  same  for  lexicographic 
utilities  when  (4.12)  holds. 


Chapter  5 


ADDITIVE  UTILITIES  WITH 
INFINITE  SETS 


This  chapter  presents  two  well-structured  theories  for  additive  utilities  on 
infinite  sets.  The  earlier  theory,  due  to  Debreu  (1960),  is  presented  in  Section 
5.4.  It  is  based  on  topological  notions  that  are  defined  in  Section  5.3.  The 
other  theory,  due  to  Luce  and  Tukey  (1964)  and  Luce  (1966),  is  given  in 
Section  5.2.  As  Krantz  (1964)  has  noted,  proofs  in  the  latter  theory  can  be 
based  on  the  theory  of  ordered  groups.  Section  5.1  presents  some  of  this 
theory. 

Throughout  this  chapter,  A"is  a  complete  Cartesian  product,  X  ~  XI, Li  Xu 
and  <  is  assumed  to  be  a  weak  order.  Partly  as  a  result  of  these  assumptions 
along  with  a  rather  “tight”  structure  for  <  on  X,  we  shall  not  require  all  of 
condition  C  of  Theorem  4.1.  When  n  =  2,  C3  (condition  C  with  m  —  3)  will 
suffice,  and  when  n  ;>  3,  Ca  as  in  Theorem  4.3  will  do.  The  assumptions  of 
the  theories  imply  the  existence  of  additive  utilities  that  are  unique  up  to 
similar  positive  linear  transformations.  By  this  we  mean  that  if  real-valued 
functions  un  on  Xlt ...  ,  Xn  satisfy  x  <  y  o  ufxt)  <  ufy^, 

for  all  x,  y  e  X,  then  real-valued  functions  vlt . . .  ,  v„  on  Xt, .  . .  ,  X„  satisfy 
i>i(Xi)  <  2^,  vfyi),  for  ail  x,  y  e  X,  if  and  only  if  there  are 
numbers  a,  bu  . .  . ,  b„  with  a  >  0  such  that 

vfa)  =  aufXf)  +  bi  for  all  x4  e  X{;  i  *  1, . . . ,  n.  (5.1) 

5.1  STRICTLY  ORDERED  GROUPS 

A  group  is  a  set  Y  and  a  function  that  maps  each  (x,  y)  e  Y  x  Y  into  an 
element  x  4-  y  in  Y  such  that  for  every  x,y,ze  Y  and  some  fixed  element 
ee  Y, 

Cl.  (x  -f  y)  +  z  —  x  -f-  (y  +  z)  (associativity) 

G2.  x-\-e  =  e  +  x=sx  (identity) 

C3.  there  is  —x  e  Y  such  that  x  -f  (— x)  =  —x  +  x  =  e. 

(additive  inverse) 


54 


Strictly  Ordered  Groups 


55 


e  is  the  group  identity  and  —x  is  the  inverse  of  x  A  group  (Y,  +)  is  com¬ 
mutative  if  the  following  holds  throughout.  Y : 

G4,  x  +  y  =*y  +  z. 

(Re,  +)  with  4-  natural  addition  and  e  —  0  is  a  commutative  group.  So  is 
({0,l},+)with-0  =  0,-1  =  1,0 +  0=1  +  I  =0,0+  1  =  1  +  0  =  1 
and  e  =  0. 

When  m  is  a  positive  integer,  mx  =  x  +  x  +  •  ■  •  +  x  (m  times).  When  m 
is  a  negative  integer,  mx  =  —x  —  x  —  x  (~~m  times).  0®  =  e.  If 
( Y ,  +)  is  a  group  and  m  and  n  are  integers,  it  is  not  hard  to  show  that 
mx  +  nx  =  (m  +  n)x. 

Definition  5.1.  A  strictly  ordered  group  (Y,  +,  <)  is  a  group  ( Y,  +)  and 
a  strict  order  <  on  Y  such  that,  for  all  x,y,ze  Y , 

x  y=>  x  +  z  y  +  z  and  z  +  x  <  z  +  y.  (5.2) 

A  strictly  ordered  group  is  Archimedean  if  and  only  if  for  all  x,  y  f  Y, 
(e  <  x,  e  <  y)  =>  y  <  mx  for  some  positive  integer  m. 

Let  Y  =  {(J,  k) : j  and  k  are  integers},  let  +  be  natural  addition,  and  let 
<  =  <L,  so  that  (j,  k)  <  (j\  k')  o j  <  j'  or  (/  =  /,  k  <  k').  Then 
(Y,  +,  <)  is  a  strictly  ordered  group,  but  it  is  not  Archimedean  since 
(0, 0)  <(1,0)  and  (0, 0)  <  (0, 1)  and  /n(0,  1)  =  (0,  m)  <  (1,  0)  for  every 
positive  integer  m.  However,  additive  utilities  exist  for  this  case  (Exercises 
lc,  2).  On  the  other  hand  if  Y  =  {(r,  s):r  and  s  are  rational  numbers}  then 
again  (Y,  +,  <L)  is  a  non- Archimedean  strictly  ordered  group  but  additive 
utilities  do  not  exist  for  this  case  (Exercise  16). 

The  following  theorem,  due  to  Holder  (1901),  is  used  in  the  next  section. 
The  proof  given  is  similar  to  Fuchs’  proof  (1963,  pp.  45-46), 

THEOREM  5.1.  Suppose  that  (Y,  +,  <)  is  a  strictly  ordered  group.  Then 
(K,  -f ,  <)  is  Archimedean  if  and  only  if  there  is  a  real-valued  function  f  on 
Y  such  that,  for  all  x,yeY , 

x<y  of(x)  <f(y )  (5.3) 

/(*  +  y)  =/(*)  +/(y)-  (5-4) 

Moreover,  if  (5.3)  and  (5,4)  hold  and  if  a  real-valued  function  g  on  Y  also 
preserves  order  [ns  in  (5.3)]  and  is  additive  [as  in  (5.4)]  then  there  is  a  real 
number  c  >  0  such  that 

g(x)  =  cf  (a:)  for  all  x  e  Y, 

and  c  is  unique  if  e  <  x  for  some  x  e  Y. 


(5.5) 


56 


Additive  Utilities  with  Infinite  Sets 


Proof.  The  fact  that  (5.3)  and  (5.4)  imply  the  Archimedean  property 
follows  from /(<?)  =  0,  using  G2,  To  show  the  converse  we  assume  that  the 
Archimedean  property  holds  and  consider  two  exhaustive  cases. 

Case  1:  set  Y  has  a  smallest  “positive”  element  x  so  that  e  <  x  and 
e  <  y  <  *  for  no  y  e  Y.  By  the  Archimedean  property  and  e  <  x  <  2x  <  •  •  • , 
0  <y  implies  that  mx  <  y  <  (m  -f  for  some  positive  integer  m,  where 
z'£y<!>z<y  or  z=sy,  Therefore  e  <  y  —  mx  <  x  by  (5.2).  (73,  G2,  and 
(m  +  \)x  =  mx  +  x.  But  then,  by  hypothesis  for  this  case,  e  —  y  —  mx, 
and  thus  y  =  mx  by  (5.2),  G3,  and  (72.  Likewise,  if  y  <  e  then  y  =  mx  for 
some  negative  integer  m.  Let  f(y)  =  m  when  y  —  mx.  If  (y  =  mxx,  z  =  m%t) 
then y  <  z  m,  <  so  that  (5.3)  holds,  and f{y  -f  2)  =/ (mrx  +  m^c)  — 
f{(m x  4-  m2)^)  =  mx  +  m2,  verifying  (5.4). 

Case  2:  if  e  <  x  then  e  <  y  <  £  for  some  ye  f,  For  this  case  we  first 
establish  <74  (commutativity).  Suppose  e  <  y  <x.  Then  either  2y  ^  x  or 
x  <  2y.  In  the  latter  case  x  —  y  <  y  by  (5.2)  and  2y  —  y  —  y,  so  that 

(x  —  y)  +  (x  —  y)  <  (x  —  y)  +  y  by  (5.2)  and  hence  2{x  —  y)  <  x  by  <71, 

G3,  and  G2.  Moreover,  e  <  x  —  y  by  (5.2)  and  (73,  and  y  —  x  <  x  since 
y  —  x  <  e  and  e  <  x.  It  follows  that  if  e  <  x  then  there  is  a  2  e  T  such  that 
e  <  2  <  .r  and  2z  x.  Now  suppose  that  Y is  not  commutative:  for  definite¬ 
ness  assume  that  e  <  a,  e  <  b,  and  a  +  b  b  +  a  with  b  4-  a  <  a  +  b. 
Then  let  x  =*  (a  +  b)  —  (b  4-  a)  so  that  e  <  x  by  (5.2)  and  (73,  and  let  z  be 
such  that  e  >  z  <  x  and  2z  <.  x  as  just  established.  By  the  Archimedean 
property  (mz  <  a  <  (m  4-  1  )z,  nz  <  b  <  (n  4-  \)z)  for  non-negative  integers 
m  and  n.  Hence  a  4-  b  <  (m  4-  l)z  4-  b  <  (m  4  l)z  4-  (n  4-  ])z  =  (m  4- 
n  +  2)z  and  (h  4-  m)z  =  nz  -f  mz  <  b  +  mz  <  b  4  a,  or  —{b  4  a)  < 
—  («  4  m)z,  so  that  x  =  (a  4  b)  —  (b  4  a)  <  (m  -f-  n  4  2)z  —  (n  4  m)z  = 
2z,  or  x  <  2z  thus  contradicting  2z  ^  x.  Hence  (74  holds. 

For  Case  2 /is  defined  as  follows,  assuming  e  <  x  (or  some  x  e  Y  to  avoid 
the  trivial  situation.  Fix  a  with  e  <  a  and  set /(a)  *  1.  For  x  e  Y  let 

l.x  —  {mjn:ma  <_  nx,  m&n  integers  with  n  >  0} 

Ux  —  {mjtr.nx  <  ma,  m&n  integers  with  n  >  0}. 

{Lx,  Ux }  is  a  partition  of  the  rational  numbers  with  mjn  <  r/s  whenever 
mjn  e  Lx  and  r/s  €  Ux,  as  is  easily  seen.  (For  example,  if  e  <  x  then  ma  ^ 
nx  =>  sma  ^  snx  and  sx  <  ra  =>  nsx  <  nra  so  that  sma  <  nra,  or  sm  <  nr 
(since  nr  >  0),  or  mjn  <  r/s.)  It  follows  that  there  is  a  unique  real  number 
/(x)  such  that 

f(x)  —  sup  Lx  —  inf  Ux. 

To  prove  that  f{x  4  y)  =/(x)  +  f(y)  suppose  first  that  m/n  e  Lx  and 
r/s  e  Ly.  Then  ma  ^  nx  and  ra  <  sy.  Hence  sma  <_  snx  and  nra  <  nsy  so 
that  ( ms  +  nr)a  ^  ns(x  4-  y),  where  nsx  -f  nsy  =  ns(x  +  y)  on  using  (74 


Algebraic  Theory  for  n  Factors  57 

repeatedly  to  get  mx  +  my  —  x+y  +  x-Yy  +  -''-Yx  +  y.  Therefore 
(ms  4-  nr)jns  =  (mjn)  4-  (r/s)  is  in  Lx+V.  Similarly,  if  mjn  e  Ux  and  r/s  e  Uv 
then  min  4-  rjs  is  in  Ux¥y.  It  follows  that 

sup  Lx  +  sup  Ly  <,  sup  Lx+y  ~f(x  +  y)  =  inf  Ux+t  <;  inf  V.x  4-  inf  Uv 

and  hence  that  f(x  +  y)  —  sup  Lr  +  sup  Lv  —  f(x)  +  f(y).  This  proves 
(5.4). 

To  establish  (5.3)  suppose  e  <  z.  Then  a  <  mx  for  some  positive  m  and 
hence  1/m  e  Lx  so  that /(a:)  >  0.  Similarly  if  x  <  e  then  /(— z)  >  0,  and 
f(e)  =  G  by  <72  and  (5.4).  Hence  e  <  x  oO  </(z),  which  is  easily  seen  to 
imply  (5.3). 

The  final  part  of  the  theorem,  namely  (5.5),  is  proved  as  follows.  If  Y  =  { e } 
then/(e)  —  g(e)  —  0  and  every  c  satisfies  (5.5).  Next,  suppose  that  e  <  x  for 
some  x  e  Y.  If  Case  1  above  holds  then,  with  e  <  x  and  e  <  y  <  x  for  no 
y  e  Y,  f(z)  =  mf  (x)  and  g(z)  =  mg(x)  when  2  ==  mx  so  that  g(z)  = 
[g{x)lf  (z)]/(z)  for  all  2  e  Y.  On  the  other  hand  suppose  Case  2  holds  with 
e<  a.  Then,  by  (5.3)  and  (5.4),  mf  (a)  <;  nf  (z),  mg(a)  ^  ng(x),  sf(x)  <  rf  (a) 
and  sg(x)  <,  rg(a )  for  all  m/n  e  Lx  and  rjs  £  Ux ,  from  which  it  follows  that 
f(x)lf(a)  =  g(x)/g(a),  or  g(x)  =  lg(a)/f(a)]f(x)  for  all  xe  Y.  ♦ 


5.2  ALGEBRAIC  THEORY  FOR  n  FACTORS 

The  additive-measurement  theory  developed  by  Luce  and  Tukey  (1964)  and 
Luce  (1966)  is  based  on  the  idea  that  a  difference  in  two  levels  of  one  factor 
can  be  offset  by  a  compensating  difference  in  the  levels  of  any  other  factor. 
For  example,  given  z®  e  Xl  and  z®,  x\  e  X2,  the  compensation  or  “solva¬ 
bility”  assumption  says  that  (z®,  z*)  ~  (x\,  x®)  for  some  zj  e  Xx.  If  X  — 
Xt  x  X2  then  (zj,  x\)  e  X  and  again  by  solvability  (z*,  z,l)  ~  (x*,  x®)  for 
some  x\  e  Xx.  Under  the  cited  conditions  this  gives  rise  to  the  picture  in 
Figure  5.1  where  the  broken  curves  represent  indifference  sets.  Suppose 


\  > 
\  * 


1  \  \  ' 
i  __  \  \  ^ _ \ _ 

^ 1 

\ 

c ^ 

\ 

\ 

\ 

te - S 

% 

\ 

. _ N 

\ 

. — ^ 

\ 

\ 

. — N 

*r  \ 
0 


i  2 


xi  \  *1 

3  4 


Figure  5.1  X  =  X\  x  X2. 


58 


Additive  Utilities  with  Infinite  Sets 


additive  utilities  exist  for  this  two-factor  case  and  that,  for  points  on  e,  such 
as  (x®,  2®),  u^x®)  -f  ua(x£)  a*  0,  and  for  points  on  a,  such  as  (x*,  x®), 
ux (x|)  +  «4(x®)  =  1,  with  (x®,  x®)  <  (x\,  x\).  Then,  as  is  easily  verified,  the 
value  of  «i  +  wa  for  the  first  curve  to  the  right  of  a  must  be  2,  for  the  next 
curve  Mi  +  Ma  —  3,  and  so  forth.  Thus,  if  x  <  y  o  ufx t)  -f  wg(Xj)  <  u^)  -f 
Ui(y%),  then,  for  any  y  e  X  there  must  be  a  positive  integer  k  such  that 
y  <  (x*,  x®),  Hence,  under  unrestricted  solvability,  we  have  a  necessary 
Archimedean  axiom  for  the  two-factor  case.  It  is  P3  in  Theorem  5.2. 

Two  Factors 

In  the  following  Theorem  PI  (C3  of  Theorem  4.1)  and  P3  are  necessary 
conditions  for  weak-order  additivity  when  X  —  Xt  x  X2,  but  unrestricted 
solvability  (P2)  is  not.  Except  in  the  trivial  case  when  Xj~  =  {X},  P2  requires 
both  ux  and  «2  to  be  unbounded  above  and  below.  Luce  (1966)  shows  how  to 
weaken  P2  to  avoid  the  unboundedness  implication:  see  also  Krantz  (1967, 
pp.  25-27). 

THEOREM  5.2.  Suppose  X  —  Xx  X  X2  and  the  following  three  conditions 
hold  throughout  X: 

P\.  [(a?1,  x4,  a3)  £s  (y1,  y1,  y3),  xi  <  f  or  Xs  ~  yi  for  j  <  3]  =>  not  x3  <  y3. 
P2.  (xt,  y1eX1;xie  X2)  =>  (xx,  x8)  ~  (yu  y2)  for  some  yz  e  Xz,  and  (xx  e 
Xx  \  Xt,  yz  e  X2)  =>  (xl3  xz)  ~  (yu  yz)for  some  yt  e  Xt. 

P3  [(x°,  x®)  <  (x?,  x3),  (x*~\  x\)  ~  (x*  x®)  for  k  =  1 ,  2, .  . .  ;  y  e  X]  => 
y  <  («J,  *2)/°''  jowc  fc  e  {1 , 2, . . .}. 

Then  there  are  real-valued  functions  ux  on  Xx  and  w2  on  X2  such  that 

X  <yo  ufxx)  +  u8(x8)  <  Ul(yt)  +  u8(y8),  for  all  x,yeX,  (5.6) 

and  uy  and  wa  satisfying  (5.6)  are  unique  up  to  similar  positive  linear  trans¬ 
formations. 

Proof.  PI  implies  that  <  is  a  weak  order  (asymmetric,  negatively 
transitive)  so  that  is  an  equivalence.  Let  X/~  be  the  set  of  equivalence 
classes  of  X  under  ~  and  fix  (xj,  x®)  e  X.  By  P2,  each  element  in  X/~ 
contains  elements  in  X  of  the  form  (xlt  x®),  (x®,  xa).  Define  +  on  XI ~  as 
follows : 

with  a,  be  XJ~,  a  +  b  is  the  element  in  Xj~  that  contains  (xx,  xj 

when  (xlf  x®)  e  a  and  (x®,  x8)  e  b.  (5.7) 

With  a  <'  b  o  x  <  y  for  some  xea  and  y  e  b,  we  first  verify  that  (Xl~, 
+  ,  <')  is  a  strictly  ordered  commutative  group.  We  then  show  that  it  is 
Archimedean  and  use  Theorem  5.1. 


Algebraic  Theory  for  n  Factors  & 

1.  +  is  well  defined.  By  PI,  (xv  x%  (yv  x®)  e  a  and  (a??,  x*),  (x°v  y2)  e  b 
imply  that  {xx,  xa)  ~  (th,  2/a). 

2.  Commutativity,  (74.  By  PI,  (*v  ®S),  (*?,  y»)  e  tf  and  (a*.  **),  (jq,  x®)  e 
b  =>  (xx,  x2)  ~  (yY,  y*)>  and  hence  by  (5.7),  a  +  b  ~  b  +  a. 

3.  Associativity,  <71.  By  (74,  (a  4  b)  4  c  =  a  -f  (b  4-  c)o  c  4  (ft  +  b)  — 
a  4  (c  +  b ).  Let  (xv  x“)  e  a,  (a*,  a,)  e  b,  (yv  a®)  e  c,  (a®,  y2)  e  a  +  b,  and 
(x°v  zjec  +  b.  Hence  (x®,  y2)  ~  (xx,  x2)  and  0lf  xg)  ^  (*?>  *2)-  Hence,  by 
PI,  (yx,  yt)  (*x,  2a),  which  yields  c  -f  (<?  4  b)  ~  a  +  (c  4  b). 

4.  Identity,  G2.  Let  e  contain  (a®,  a®).  By  (74,  «  +  a  —  +  e.  With 

<x.,  x®)  e  a,  (5.7)  implies  (xv  xfyea  +  e.  Hence  a  =  a  4  e. 

5.  Additive  Inverse,  (73.  Define  -a  as  that  clement  in  Xj~  that  contains 
<x®,  x2)  when  (a,,  a®)  6  a  and  ( xu  a,)  £  e.  Then,  by  (5.7),  -«  +  a  =  e. 

6.  <'  on  X{~  is  a  strict  order  by  Theorem  2.1.  Suppose  a  <'  b.  With 
(x. .  a®)  e  a,  (yv  *$)  e  b,  and  (x®,  x2)  e  c,  let  zlt  by  P2,  satisfy  (z*>  a*)  ~  (yvxp- 
With  (xvx°)<  (jq,  x«)  also,  Theorem  2.1  yields  (xl5x®)<  (%, x.),  which 
along  with  (zj,  xa)  (Vi>  XV  tinder  PI  yields  not  (3/1,  xg)  -<  (aY,  x2)  and  in 
fact  (xu  x2)  <  (ylt  x2)  since  (xx,  xg)  ~  (3/1,  *,)  gives  a  violation  of  Pl.  Hence 
a  +  c  <  b  4  c,  so  that  (5.2)  holds. 

To  prove  that  (Jf/~,  4,  <')  is  Archimedean,  suppose  (e  <  a,  e  <  b). 
With  (x®,  x*)  e  a,  let  the  sequence  in  PI  be  constructed  as  described  in 
connection  with  Figure  5.1.  Since  (x®,  x\)  e  a  and  (x*,  x®)  e  a,  (5.7)  says  that 
(xL  x\)  e  2a.  Then,  since  (*«,  x®)  ~  (x{,  xj),  (*{,  x»)e2«.  Using  (5.7)  to 

continue  this  we  see  that  ( x\ ,  x®)  e  ka,  k  =  1,  2 . With  yet,  P3  says 

that  y  <  (x*,  x®)  for  some  it,  which  gives  b  <'  ka  for  some  positive  integer  k. 

Hence  (X/~,  4,  <')  is  Archimedean. 

Thus,  by  Theorem  5.1,  there  is  a  real-valued  function/on  X}~  such  that 
f{a  4  b)  —f(a)  +  f(b)  and  a  <  b  of  (a)  <f(b).  Defining  ux{xt)  =/(<?) 
when  (xv  x®)  g  a,  and  ufxt)  =f(b)  when  (x®,  x2)  e  b,  (5.6)  follows  easily. 

Suppose  i/'i  on  Xx  and  v2  on  X2  also  satisfy  (5.6).  Defining  g  on  Xj^  by 
g[a)  =  [»!(*!>  -  yx(^)]  4  [»*(*>)  -  »,(*S)1  when  (xx,  xt)  £  n,  it  follows  that, 
taking  (xt,  x®)  e  a  and  (x°,  x2)  eb  so  that  (xlt  xg)  e  a  +  g'(n)  +  ^(b)  — 
[^(x,)  -  vf^)]  4  0  +  0  4  [d2(x2)  -  i;4(x®)]  =  g(a  4  b).  Moreover,  from 
this  and  (5.6),  g(a)  <  g(b)  o  a  <'  b.  Hence,  by  Theorem  5.1,  g  =  cf  for 
some  positive  number  c.  It  follows  that,  taking  (xJ}  x®),  ^(xj)  —  yx(x®)  — 
cufxj,  or  vfx,)  =  c«1(x1)  4  0X(*?)  for  all  xx  e  3rx.  Similarly,  vz(x2)  = 
cus(x2)  4  for  all  x2  e  Xz.  ♦ 

Three  or  More  Factors 

We  now  consider  a  version  of  Luce’s  theory  (1966)  for  more  than  two 
factors.  As  pointed  out  to  me  by  David  Krantz  (correspondence),  the 
independence  condition  C,  can  be  replaced  by  C*  in  this  case.  This  necessi¬ 
tates  of  course  the  explicit  assumption  that  <  is  a  weak  order  (or  negatively 


60 


Additive  Utilit-'  with  Infinite  Sets 


transitive)  since  weak  order  does  not  follow  from  C8,  or  PI*  as  we  call  it 
below. 


THEOREM  5.3.  Suppose  X  =  JJ*  :1  Xi}  n  >  3,  <  on  X  is  a  weak  order > 
and  the  following  hold  throughout  X: 


PI*,  [(a?,  2)  £a  (y,  w),  x  <  y  or  x  ™  y]^>  not  z  <  w. 
P2*.  [/  e  {1 , . . .  ,  n}t  x  e  X and yi  £  X^for  all j 
zo  Ui+u  •  •  •  > Vn)f°r  some  zt  e  X(. 

P3*.  [(x»,x«, 
xfyfork  =1,2, 


(Vu 


Vi~i, 


• ,  O  <  (x°vx\>  ■  •  ■ ,  *i),  ~  (*J,  , 

. .  ;y£X]^>y  <  (asj.seg, . . .  ,  a^)/<?r  jowe/c  e  {1, 2, . . .}. 


Then  there  are  real-valued  functions  ut, ...  ,un  on  Xit Xn  respectively 
such  that 

fi  tl 

x<yo^  ufxt)  <  ^  /or  a//  A',  (5.8) 

i»i  t-i 

and  uu  .  , .  ,  un  satisfying  (5.8)  are  unique  up  to  similar  positive  linear  trans¬ 
formations. 


Proof.  Our  major  task  will  be  to  show  that  C3  or  Pi  for  n  3  follows 
from  the  stated  hypotheses.  We  delay  this  until  later,  assuming  for  the 
moment  that  Cs  or  PI  holds.  Fix  (x®, . . . ,  x®).  By  P2*  any  a  e  X[~  contains 
elements  of  the  form  (x*  for  i  e  I,  x®  for  i  $  /)  for  any  nonempty  proper  subset 
I  <=■  Define  -f  on  Xf~  as  follows: 


a  +  h  is  the  element  in  X\~  that  contains  (aslt  . . .  ,xn)  when,  for  any 
nonempty  / c  (x,  for  i e I,  x®  for  i$I)ea  and  (x®  for 

/  e  I,  xt  for  i$l)e  b. 


To  show  that  +  is  well  defined  suppose  (x*  for  i  e  I,  x®  for  /  $  /)  e  a,  (x®  for 
i  e  /,  x,  for  i  /)  e  6,  (*/,  for  i  e  /*,  x°  for  i  $  /*)  e  a,  and  (x?  for  i  e  /*, 
for  i$l*)eb,  when  /  and  /*  are  any  two  nonempty  proper  subsets  of 


(y 


u 


,  yn ),  and  this  is 


(1 , ....  «}.  We  need  to  prove  that  (xx, . . .  ,  xn) 
easily  seen  to  follow  from  PI. 

By  analogy  with  the  preceding  proof  (let  X2  there  represent  X%  x  •  •  •  x  XH 
here)  it  follows  from  PI  and  the  hypotheses  of  Theorem  5.3  that  (A”/'’-’,  +  , 
<')  is  an  Archimedean  simply  ordered  (commutative)  group.  With  /  as  in 
(5.3)  and  (5.4)  and  (x®, . . . ,  x®)  e  e,  define  u^x,.)  ==  f(a)  when  (x®, .  . .  ,  x°_lf 
x(,  x®+1, . . .  ,  x®)  e  a,  and  define  u(x)  =  f  ( a )  when  x  e  a.  Then  u(x)  <  u(y)  <=> 
f(a)  <  f(b)  so  that  x  <  y  o  u(x)  <  u(y).  Moreover,  with  (x)~  the  element 
in  X/~  that  contains  x,  it  follows  from  successive  uses  of  our  definition  for  + 

that  (xl5 .  . .  ,  xnr  *  (xr  x® - -  x«)~  +  (a*,  x2 - -  x„)~  =  (x,,  x“, . . .  , 

<r  +  [(*?,  xs,  x®, . . . ,  x°nr  +  c*?,  *\>  .  •  • ,  i  =  •  •  *  *  •  •  • . 

x°„)~  +  ( xv  xt'  xl>  ■  •  •  *  +  •••  +  (*?.•••.  *«r.  from  which  we 


Algebraic  Theory  for  n  Factors 


61 


obtain  u(x)  ~  ux{x x)  -f  wa(xa)  4-  •  *  •  +  u„(xn).  The  proof  of  uniqueness 
follows  from  Theorem  5.1  as  in  the  preceding  proof. 

Proof  of  PI.  Let  X  =  H"ml  Xif  n  ^  3,  and  assume  that  PI*  and  P2* 
hold  and  that  <  is  a  weak  order,  To  verify  PI  we  begin  with  the  following 
general  form:  show  that  (xu  *a, . . . ,  x6)  <  (yu  yt,  y3,  yit  x6}  xe)  when 

(xH  *2>  ZH  2s.  2«)  <  (*/i,  2a.  Vs>  «4>  *5>  2s)  (5.9) 

(2i»  Hi  x3>  xa  hi  h)  “4  (2i)  2/2j  Vi<  2s*  U)-  (5,10) 

This  includes  all  possible  placements  of  x,,  yit  etc.  in  the  two  given  statements. 
It  should  be  understood  that  some  dimensions  may  be  collected  into  a  single 
i  in  (5.9)  and  (5.10)  and  that  one  or  more  of  the  i  patterns  in  (5.9)  or  (5,10) 
may  be  absent  in  a  specific  case. 

Suppose  first  that  the  first  dimension  (»  =  1)  in  (5,9)  and  (5.10)  is  actually 
present.  Using  P2*  let  s1  satisfy  (omitting  parentheses  and  commas) 
^iHn~ihz6 r^1  xix3z3HziH'  Then,  by  PI*,  zxx3XgXfZaza  SinxuxshZe-  Also, 
since  WsVa2*  <  yiWsHhn  by  (5.9),  PI*  implies  <  ViVtiWAft' 

Also,  by  (5.10)  and  PI*,  <  Siy^HUsHH-  Hence,  by  transitivity, 

^  yaHy&iHHi  so  that  by  PI*  x1x2x3x4x4xs  <  yxV^i^t- 
The  key  to  this  proof  was  that  the  same  element  (zx)  appeared  in  the  first 
position  on  each  side  of  (5.10).  A  similar  proof  holds  if  either  the  fourth 
dimension  is  present  (s4  on  both  sides  of  (5.9))  or  the  sixth  dimension  (za) 
is  present.  Assume  henceforth  that  none  of  these  three  dimensions  is  actually 
present.  Renumbering  subscripts,  (5.9)  and  (5.10)  then  reduce  to 

(HiVa  h)  (5.H) 

(zij  ^a)  ^  (Vii  zti  zs)-  (5.12) 

We  are  to  show  that  ^  yx y&z.  Assuming  that  the  third  dimension  is 
present  let  sz  satisfy  xlz^ti  ~  z^,.  Then,  by  (5.1 1)  and  PI*,  yxzzs3  <  yiylt3. 
Also,  x1zaz3  ~  and  (5.12)  satisfy  the  condition  in  the  preceding  proof 
and  can  conform  to  Es  so  that,  by  PI  for  this  case  x^t,  <  yxz&3.  Then,  by 
transitivity,  xlxit3  <  y&zt3  s°  that  xxx^t3  <  Viy%x%  by  PI*. 

Finally,  suppose  that  the  third  dimension  in  the  preceding  paragraph  (fifth 
dimension  originally)  is  not  present.  This  leaves  us  with  only  two  patterns. 
But  n  3.  Therefore,  we  have  a  case  like 

(••■ij  z2,  2a)  ^  (zi>  y»i  y3)  (5.13) 

(H,  xs)<  (yu  Hi  h)  (5.14) 

from  which  we  are  to  show  that  xjXjX3  ^  Let  s3  satisfy  XjZgZ a  ~  z^s- 

By  (5.13)  and  PI*,  yiV1*  ^  ViUtP *■  Also,  x^jZj ~ zxz,j8  and  (5.14)  satisfy 


62 


Additive  Utilities  with  Infinite  Sets 


the  previous  pattern  for  which  jPI  holds  (z4  on  both  sides  of  the  ^  statement), 
so  that  by  PI  for  this  case,  ^  yizzs3.  Then,  by  transitivity,  XjXgX3  ^ 
JWa-  ♦ 

5J  TOPOLOGICAL  PRELIMINARIES 

To  obtain  a  sound  understanding  of  Debreu’s  (1960)  additivity  theory  a 
review  of  some  theory  of  topology  is  in  order.  Familiarity  with  Section  3.4  is 
assumed. 

A  topological  space  ( X ,  73)  is  connected  if  and  only  if  X  cannot  be  parti¬ 
tioned  into  two  nonempty  open  sets  (in  73),  The  closure  A  of  A  £  X  is  the 
set  of  all  y  e  X  for  which  every  open  set  that  contains  y  has  a  nonempty 
intersection  with  A : 

A~{y:yeX  and  {y  e  B,  Be  7S)=>  A  n  B  *  0).  (5.15) 

(X,  73)  is  separable  if  and  only  if  X  includes  a  countable  subset  whose  closure 
is  X.  (Re,  TL)  is  separable  (as  well  as  connected)  since  Re  is  the  closure  of  the 
set  of  all  rational  numbers. 

The  following  is  Debreu’s  Proposition  4  (1964,  p.  291). 

LEMMA  5.1.  Suppose  <  on  X  is  a  weak  order,  ( X ,  73)  is  a  connected  and 
separable  topological  space,  and  (x :  z  e  X,  x  <  y)  e  73  and  {x :  x  e  X,  y  <  x}  e 
73  for  every  y  e  X.  Then  there  is  a  real-valued function  u  on  X  that  is  continuous 
in  the  topology  73  and  satisfies 

x<y  u(x)  <  u(y),  for  all  x,yeX.  (5. 16) 

Proof.  By  separability,  X  includes  a  countable  subset  A  with  A  ~  X.  If 
x  <  z  then  {y.y  <  z)  and  {y:x  <  y)  are  nonempty  intersecting  (by  con¬ 
nectedness)  open  sets  with  intersection  {y.x  <  y  <  z)  e  73.  Then,  by  A  ~  X 
and  (5.15),  {y.x  <  y  <  z)  n  A  0.  Hence  A  is  -< -order  dense  in  X. 
Theorems  3.1  and  3.5  complete  the  proof.  ♦ 

Lemma  5.1  is  used  in  the  next  section.  Lemma  5.2,  based  on  the  following 
definition,  is  used  later  in  this  section.  Given  a  topological  space  ( X ,  73), 

Y  £  X  is  connected  if  and  only  if  (Y  r\  A  y*  0,Yr\B9±0,Y£AuB, 
y  HA  n  B  -  0)  is  false  for  every  A,  B  6  73. 

LEMMA  5.2.  If  A  £  X  is  connected  for  each  A  e  A  and  if  A  r,  A*  jd  0 
when  A,  A*  e  A  then  (J.4  A  is  connected. 

Proof.  Suppose  Y  —  [Ja.A  is  not  connected.  Then  (Y  n  B 0, 

Y  n  C  5*  0 ,  Y<=  BuC,  Y  nB  DC  =  0)  for  some  B,  C  e  73.  Let 


Topological  Preliminaries  ,, 

A,A*  eA  satisfy  (A  n  U  &  0 ,  A*  n  C  5* s  0),  Usine  B  and  r  it  f^n 

bvathv  UthA*  h  n0t  COnnected  and  that-  since  each  of  And  is  connected 

U  ayPft°  ae*,f”  lmUSt  be  tfUe  that  (A  nCsa  0>A*  ^  B  —  0).  But  then 

A*  n  B  =  (a  nB)n\A*no~AnA*C  ^dhhtoec  0s*A'^Cn 
hypothesis,  +  (/l  r,  C)  -  A  n  A  ,  whlch  contradicts  our  second 

Product  Topologies 

If  X  =  ri/  - Xt  and  (Xu,  US.)  is  a  topological  space  for  each  /  let 
TI%-[a.Ac:X  and  if  (xi . x.t)  e  A  then  there  are  Ai  e  7S*  for 

which  xt  e  /4,-  (z  ~  t,  ...  ,n)  and  JJ  A.-  c  ^  (5.17) 

Th*  ^  n  <*.  *,> 

SiVe  nT  1  ?«  K®  (Dcfl”tion  3-2)'  we  note  flret 

1  »n~,  eH  Let  B  be  a  union  of  sets  in  IT'S,  with 

which  it  uz  that  re  ntS;,i“^e“ion  ^JhVz 

*  ej?  ir.  nfo;  ; and  i « ^  & 

Exercise  13  gives  an  equivalent  definition  of  a  product  topology. 

'iuhrnmx  T\tT“  (sefa,able}  ' ‘/"logical  space  for 
1  ’  1  inen  'll  xu  II  '>,)  is  connected  {separable). 

Proof  Separability.  Let  (x„  IS,)  be  separable  for  each  i  with  A  <c  y 
countable  and  A.  =  rt.  Given  xq  X  =  TJ  x  suppose  “  l  B  ,TTr'  ft,  * 
by  (5.17)  there  are  5, .  fS,  such  that  *  sEeV*) 

B  nTlFLV,^  «  ■  therefore  (IT  «,l  ‘  (if  A,)  *  0  ^'that 

vll  »)  5^  0 .  It  follows  that  x  is  in  the  closure  of  TT  A 

sho“7x  u-ingxl:  TJT:*  co,nr ed  sr i u  is  uot  h°rd  » 

aubset  0fxLUX,  wheM*,^)  J  S^‘  li,"  each  “)°t 
connected  so  that  x  {x,}  x  •  •  ■  x  hr  )  u  f»  W  At  ,  ?  (  *’  <}f  b? 
is  connected  by  Lemma  5  2 \inco  J  ^  ,  2  {*a}  X  *  "  *  <*■> 

union  Since  r*  t  t  (Vl'  **’ '  *  ' »  *■>  18  in  both  Pa«s  of  the 

HeneeV^x  xTu  ““  f1  f  *  *  W  *  ‘ ' " '  x  fe“ne£ 

induction,  *  x  *  x  xWtnn  tt  ♦  *  W  "  B>' 


64 


Additive  Utilities  with  infinite  Sets 


Continuity 

The  appropriate  generalization  of  continuity  over  that  given  in  Section  3.4 
is  included  in  the  following  definition. 

Definition  5.2.  Let  ( X ,  31),  ( Y,  S),  and  (Z,  T?)  be  topological  spaces.  If 
/is  a  function  on  X  into  Y  then  continuous  if  and  only  if  5  e  8  => 

{x:x  e  X,f(x)  e  S }  6  .‘ft.  If  £  is  a  function  on  lx  T  into  Z  then 

1.  g  is  continuous  in  X  if  and  only  if  (y  £  Y,  T  {x:x  e  X,  g(z,  y)  e 

2.  g  is  continuous  in  Y  if  and  only  if  (x  £  X,  T  e  IS)  ==>  {y;2/  e  g(a:,  y)  6 
r}eS. 

The  following  lemma  is  used  in  the  next  section. 

LEMMA  5.4.  If  ( X ,  31),  ( Y,  S),  and  ( Z ,  IS)  are  topological  spaces  and  f  on 
X  x  Y  into  Z  is  31  x  S  —  IS  continuous  then  f  is  continuous  in  X  and  in  Y. 

Proof.  Let  /  be  3i  x  S  —  T>  continuous,  and  let  b  £  Y,  T  e  TS.  We  shall 
show  that  {1:2:6  X,f(x,  b)  e  T}  e  31.  For  all  x  e  X  let  g(x)  —  x,  h(x)  «=  b 
and  k(x)  =  ( g(z ),  h(x)).  As  is  easily  verified,  g  on  X  into  X  is  3t  —  3i  con¬ 
tinuous  and  h  on  X  into  T  is  31  —  S  continuous.  To  show  that  k  on  X  into 
A"  x  Y  is  3t  --  31  x  S  continuous  let  A  ^  0 ,  A  £  hft  x  S.  By  Exercise  13, 
A  has  the  form 

A  =  U  B(w)  x  C(yv) 

we  Jl' 

with  B(w)  e  31  and  C(vc)  e  §  for  all  w  e  W.  Letting  a  super  —  1  denote  the 
inverse  [Ar~J(A)  =  {jc :  Ar(a:)  e  /!}],  it  follows  that 

k~l(A)  =  *-*( U  [B(w)  x  C(w)l) 

=  U  x  C(M)] 

=  u  x  Y]n[X  x  C(H*)J) 

=  U  (k-'iBiw)  x  Y]  n  k~'[X  x  C(»v)]) 

=  U  (ft"W>»')]  HA-1  [C(» »•)])£ 31 

since  g~l[f?(ir)]  £  -ft  and  /2_1[C(t'  )]  £  31  for  every  >r. 

Let  r{x)  ~  f(k(x))  =  f  (x,  b).  Let  T  e  13.  Since  f  :{T)e  .ft  x  h,  and 
k~\f~l(T))  e  by  the  preceding  demonstration,  r~l(T)  =  k~x{f~x{T))  is  in 
Ji.  That  is,  {.r:r  £  X,J  {x.  b)  e  7"}  £  :H,  as  desired.  ♦ 

Suppose  (A",,  H.)  is  a  topological  space  for  each  i  and  u  is  a  real-valued 
function  on  X  —  X,  that  is  —  Ml  continuous  (continuous  in  the 


Topological  Theory  for  a  Factors 


65 


topology  XI «,).  Then,  as  a  coiollary  of  Lemma  5.4S  u  is  continuous  in 
TSi  for  every  nonempty  /  £  {1,  2, . . ,  ,  n).  That  is,  for  each  Ue  TL 
and  fixed  xf  for  i  $  I,  {xr:x  e  X  J  Xit  xt  —  xj*  for  all  /  £  /,  «(x)  e  £/}  e  *5(. 

5.4  TOPOLOGICAL  THEORY  FOR  »  FACTORS 

Debreu’s  (i960)  topologically  oriented  theory  for  additive  utility  with 
two  factors  is  essentially  as  follows. 

THEOREM  5.4.  Suppose  X  —  Xx  x  X%  and  the  following  three  conditions 
hold  throughout  X: 

Q\.  [(x1,  x 2,  Xs)  Ea  ( y i,  y2,  y% ),  xi  <  y>  or  xi  ~  j  <  3 J  =>notx*<  y3. 

(22.  (A"*,  "G,)  <s  a  connected  and  separable  topological  space  for  i  —  1,2. 

Ql.  {x:x  e  X,  x  <  y)  e  T?x  x  T>2  {x:x  e  X,y  <  x}  e^x  x 

Then  there  are  real-valued  functions  ux  on  Xx  and  u2  on  X2  that  satisfy  (5.6) 
and,  if  (xx,  x2)  <  (xt,  y2)  and  \.yx,  z2)  <  (zlt  z2)  for  some  quartet  of  elements 
in  X ,  then  ux  and  u2  satisfying  (5.6)  are  continuous  in  ’6l  and  T>2  respectively 
and  are  unique  up  to  similar  positive  linear  transformations. 

The  obvious  difference  between  this  and  Theorem  5.2  is  in  the  Q2  and 
(23  conditions  (Q 1  =  FI).  Debreu  ties  the  space  together  with  topological 
conditions,  whereas  Luce  and  Tukey  use  solvability.  The  need  for  the  quartet 
condition  in  Theorem  5.4  stems  from  the  fact  that  under  Q\ ,  Q2,  and  Q3  it 
is  possible  to  have,  say,  ux  constant  on  Xx  and  w2  nonconstant  on  X2,  in  which 
case  additive  utilities  are  not  unique  up  to  similar  positive  linear  trans¬ 
formations  and  «2  need  not  be  continuous.  With  no  loss  in  generality  we 
assume  in  what  follows  that  (x1?  x2)  <  (xl5  y2)  and  (yu  z2)  <  (zx,  z2)  for  some 
quartet  of  elements  in  X. 

The  most  obvious  application  of  Theorem  5.4  arises  when  Xx  and  X2  are 
intervals  in  Re.  In  fact.  Part  I  of  the  two-part  proof  of  the  theorem  assumes 
that  Xx  x  X2  is  a  rectangular  subset  of  Re2.  Part  II  then  shows  how  the 
general  case  can  be  transformed  into  the  plane.  Because  Part  I,  which  involves 
ideas  of  Thomsen  (1927)  and  Blaschke  (1928)  for  what  Debreu  calls  the 
Thomsen-BIaschke  theorem,  goes  through  many  steps  and  is  rather  long,  I 
shall  not  detail  every  step. 

Proof.  Part  I.  Throughout  we  assume  that  the  hypotheses  of  the  theorem 
hold,  that  Xx  and  X2  are  nondegenerate  intervals  of  real  numbers,  that  TSX 
and  IS.,  are  the  relative  usual  topologies  on  Xx  and  X2,  and  that 

x  <  y  =>  x  <  y 

as  in  Theorem  3.3,  condition  2. 


(5.18) 


66 


Additive  Utilities  with  Infinite  Sets 


(a,c)  zi  (b,c) 


Figure  5.2 

1.  By  Lemmas  5.3  and  5.1  there  is  a  real- valued  function  v  on  X  that  is 
continuous  in  "Gx  x  7Ss  and  satisfies  x  <  y  o  v(x)  <  v(y).  Then,  by 
Exercises  3.22  and  3.10, 

(X  <  y,  y  <  Z,  %  <  z)  =>  y  ~  x  4*  (1  —  a)z  for  a  unique  a  6  (0,  1). 

(5.19) 

2.  Let  [a,  b]  x  [c,  d]  be  a  rectangular  subset  of  Xx  x  X2  for  which 
( b ,  c )  ~  (a,  d).  (Figure  5.2.)  From  (5.18)  and  (5.19)  it  follows  that  there  is  a 
real-valued  function /  on  [cr,  b ]  onto  [c,  d]  that  is  one-to-one  with 

(*i./(*i))  (b,  c)  for  every  xx  e  [a,  6]. 

Since / strictly  decreases  as  xx  increases,  it  and  its  inverse  f~x  are  continuous. 

3.  O  ’r  immediate  goal  is  to  show  that  additive  utilities  satisfying 

«(*i,  *a)  =  ui(*i)  +  «a(*a),  (5.20) 

exist  on  [i?,  b]  x  [c,  d]  with  u  a  monotonic  transformation  of  v  in  step  1. 
First,  set  ux{a)  —  ua(c)  =  0  and  ux(b)  =  ua(d)  =  1.  Then  u(a,  c )  =  0, 
u(xx,  za)  =  1  for  all  (xx,  x2)  e /,  and  u(b,  d)  =  2.  As  shown  in  Figure  5.2 
there  is  a  zv  e  (a,  b )  such  that  (zx,  c )  ~  ( a,f(zx )).  To  prove  this  note  first  that 
since  v  is  continuous  it  is  continuous  in  Xx  and  Xt  by  Lemma  5.4.  Then,  by 
Exercise  3.16,  {v(xx,  c):xx  e  [a,  A]}  is  an  interval  in  Re.  Likewise,  {v{a, 
f(xi))'-xi  e  [tf,  b]}  =  {t>(a,  xa):x2  £  [c,  d]}  is  an  interval.  Since  v(xlt  c) 
increases  in  xx  and  i’(a,/0ci))  decreases  in  xx,  there  is  a  unique  zx  e  (a,  b)  for 
which  v(zx,  c )  =  v(a,f(zx)),  so  that  (zx,  c)  ~  ( a,f(zx )). 

Let  g  be  the  continuous  indifference  curve  through  (zx,  c ).  To  satisfy  (5.20) 
we  must  have  ux(zx)  =  u2(f(zx))  =  £  and  u(xx,  x2)  =  £  for  every  (xlt  x2)  eg. 


Topological  Theory  for  a  Factors 


67 


Figure  5.3 

Ql  implies  that  e  '  e'  as  shown  in  Figure  5.2,  with  u(xu  x*)  =  $  for  every 

(*i, «»)  eg'. 

For  reasons  like  those  given  above  there  is  a  point  (xx,/ (yx))  —  P*  in  g  for 
which  (xu  c)  ~  As  shown  in  Figure  5.3,  the  constructions  from 

P”  define  two  new  curves  h  and  k.  As  is  easily  seen  from  Ql ,  Q  ~  Q',  Q  ~  S', 
and  Q'  ~  S  so  that  Q,  Q',  S,  and  S'  do  indeed  lie  on  the  same  indifference 
curve  (k).  For  (5.20)  we  must  set  ut(xj)  —  ug(/(yx)) «  £  and  =» 
us(f(xi))  —  I  with  u  =  £  for  h  and  u  —  f  for  k.  Similar  constructions  (from 
g'  in  Figure  5.2)  hold  above /  on  Figure  5.3. 

4.  The  process  of  generating  indifference  curves  in  [a,  b]  x  [c,  d]  is 
repeated  ad  infinitum  and  yields  a  continuous  indifference  curve  for  each 
value  of  u  in  {m/2n:  1  <,  m  £  2",  n  =  1, 2, . . .}  u  {1  +  m/2n: 0  <,  m  <£ 
2n  —  1 ,  n  =  1,2,...}.  If  (xx>  x2)  and  (yX)  y%)  are  on  these  curves  then 
(*i.  **)  <  (yi>  y*)  o  *a)  <  u(yx,  y*). 

In  addition,  we  have  a  set  A  of  xl  points  in  [a,  b]  whose  set  of  ux  values  is 
{m/2":0  <,  m  <,  2",  n  =  1,2,...}  and  a  set  B  of  xa  points  in  [c,  d\  whose 
set  of  «a  values  is  {m/2":0  <,  m  <,  2",  «  =  1,2,...}  with  w(xx,  xa)  = 
«x(xx)  +  ut(xi)  whenever  (xx,  xa)  e  A  x  B. 

5.  A  —  [a,  b]  and  B  =  [c,  d],  (We  leave  this  closure  proof  to  the  reader.) 
It  follows  that 

sup  {«x(yi):yi  <,  xx,  yx  e  A}  =  inf  {vt(*0:a;l  ^  *lt  ?x  e ^}  (5.21) 

for  each  xl  e  [a,  b].  Extending  ux  on  A  to  wx  on  [a,  6]  by  defining  ux(xx)  as  the 
common  value  in  (5.21),  it  follows  easily  that  mx(xx)  <  «x(yx)  o  xx  <  yx  and 
that  .vx  on  [a,  b ]  is  continuous.  It  is  clear  also  that  once  ux(a)  and  ux(b)  are 


68  Additive  Utilities  with  Infinite  Sets 


specified,  the  rest  of  ux  on  [a,  6]  is  uniquely  determined  and  ux  on  [a,  b ]  must 
be  continuous. 

Similar  remarks  hold  for  ua  on  [c,  d],  and,  if  ux  and  u2  satisfy  (5.20),  they 
are  unique  up  to  similar  positive  linear  transformations. 

To  verify  additivity  on  [a,  b]  x  [c,  d]  suppose  first  that  (xlt  x%)  ~  (yx,  yz), 
both  points  being  in  [a,  b]  x  [c,  d).  Since  additivity  holds  on  A  x  B,  we  have 
from  (5.21)  and  its  companion  for  B  that  +  uz(xt)  =  ux{y^)  +  u2(y2). 
On  the  other  hand  suppose  that  (xx,x^  <  (yx,  y2).  Then,  as  is  easily  seen 
there  must  be  a  point  (zlt  z2)  ~  (xx,  x2)  for  which  (zx,  z2)  <  (yx,  y^  so  that 
«x(*i)  +  “2(2*)  <  «i(5h)  +  “M  and  hence  u^xj  +  u2(xz)  <  Wl(yi)  4-  u2(yt). 

6.  We  now  show  that  ux  and  uz  can  be  extended  in  one  and  only  one  way 
to  all  of  Xx  and  Xt  to  satisfy  additivity.  Beginning  with  [a,  b ]  x  [c,  d]  we 
first  extend  the  horizontal  lines  through  {a,  c)  and  (b,  c )  and  through  (a,  d) 
and  ( b ,  d),  and  likewise  for  the  two  vertical  lines.  The  indifference  curves 
through  (a,  c).  ( b ,  c),  and  (b,  d)  are  extended  also.  The  procedure  described 
in  connection  with  Figure  5. 1  is  then  used  to  generate  additional  indifference 
curves  that  must  have  u  values  of  2,  3, 4, ... ,  and  —  1 ,  —2, ....  this  process 
continuing  indefinitely  or  until  the  border(s)  of  X  (if  any)  are  reached.  This 
provides  us  with  a  grid  pattern  on  Xx  x  Xt  of  rectangles  similar  to  [a,  b]  x 
[c,  d),  except  that  some  of  these  will  be  truncated  if  X  is  bounded.  Using  Q 1 
it  is  easy  to  verify  that  (except  at  the  boundary)  the  lower  right  corner  of  any 
rectangle  is  indifferent  to  its  upper  left  corner. 

7.  We  need  to  show  that  these  rectangles  (including  truncated  ones  at  the 
boundaries,  if  any)  actually  cover  Xx  x  X%.  For  this  it  will  suffice  to  show  that 
every  xx  >  b  lies  beneath  an  indifference  curve  generated  in  the  manner  of 
Figure  5.1.  To  the  contrary  suppose,  as  in  Figure  5.4,  that  yx  e  Xx  does  not 
satisfy  this  condition.  Let  zx  —  sup  {x[\j  =  0,  1, . . .}  as  shown  on  the 
figure.  The  continuity  of  v  then  implies  that  v(zx,  c)  =  sup  {v(x[,  c):j  = 
0,  1, . .  .}  and  v(zx ,  d)  =  sup  {v(x{,  d):j  =  0,  1, . .  .}  so  that  v(zx ,  c)  —  v(zx ,  d) 
and  hence  (zt,  c)  ~  (zx,  d),  which  contradicts  (5. 18). 


Topological  Theory  for  »  Factors 


69 

8.  For  additivity  it  is  clear  that  i/j(x*)  =  j  for  each  such  point  on  the  xx 
axis  of  Figure  5.4.  Suppose  wx  e  Xx:  for  definiteness  we  assume  wx>  b  as 
shown  on  Figure  5.4.  By  the  construction  shown  for  wlf  additivity  requires 

+  ut(wt)  =  3.  But  m2(h>2)  is  already  known  since  *va  e  [c,  d).  Hence 
m1(w1)  's  uniquely  determined.  Similar  remarks  hold  if  wx  <  a,  and,  by 
symmetry',  for  points  in  Xz  not  in  [c,  d].  Thus,  given  additive  ux  and  uz  on 
[a,  b]  and  [e,  d],  ux  and  ut  are  uniquely  determined  on  all  of  Xx  and  Xt  when 
additivity  is  required,  and  they  are  continuous. 

9.  It  remains  to  show  that  (5.6)  holds  throughout  Xx  x  X%.  For  this  it  wilt 
suffice  to  show  that  (xx,  xt)  ^  (yx,  «i(xt)  +  Ma(xa)  =  ux(yx)  +  «*(y2) 
because  then  all  points  on  the  same  indifference  curve  will  have  a  common 
ux  +  value  and,  by  construction,  one  such  curve  is  to  the  left  of  another 
if  and  only  if  the  former  has  a  smaller  ux  +  w2  value. 

We  begin  this  with  the  rectangle  in  the  grid  of  step  6  that  is  to  the  immediate 
right  of  [a,  6]  x  [c,  d].  Suppose  first  that  x^y  and  these  points  are  beneath 
the  u  =  2  curve  as  shown  on  Figure  5.5.  By  the  constructions  shown  in  the 
figure,  and  using  Q\,  (P  ~  P',z  ~x)=>  Q  ~  q'  and  (P~P",z~y)=> 
R  ~  R'.  Then,  from  additivity  on  [a,  6]  x  [c,  d)  and  the  definition  of  ux 
extended,  it  is  easily  shown  that  ux(xx)  4-  u2(xa)  =  ux(yx)  +  «2(y2).  On  the 
other  hand,  if  x  and  y  lie  above  the  u  —  2  curve  we  have  the  situation  shown 
in  Figure  5.6.  Then,  by  construction  and  Ql,  {x  ~  y,  P ~ P')=>  Q  ~  Q\ 
By  the  Figure  5.5  analysis  additivity  holds  for  Q  and  Q',  and  it  readily  follows 
that  ux(xx)  +  uz(x a)  =  ux(yx)  4-  «2(y2).  By  analogy,  additivity  holds  in  each 
of  the  four  rectangles  that  have  a  boundary  in  common  with  [a.  A]  x  [c,  d\. 
By  induction,  additivity  holds  for  every  rectangle  (complete  or  truncated) 
to  the  right  or  left  of  [a,  b]  x  [c,  d]  and  above  or  below  [a,  b ]  x  [c,  d]. 

The  next  step  is  to  show  that  additivity  holds  throughout  (Xx  x  [c,  d])  u 
([«,  6]  x  JQ.  There  are  no  unusual  difficulties  in  this  and  we  omit  the  proof. 
It  can  then  be  shown  that  additivity  holds  in  each  of  the  four  rectangles  that 


Figure  5.5 


70 


Additive  Utilities  with  Imjvdte  Sets 


lie  2 


have  one  corner  in  common  with  [ a ,  h]  x  [c,  d\  and  then  that  additivity 
holds  on  all  of  (Xx  x  [c,  d])  u  ([a,  b]  x  Xt)  u  (four  corner  rectangles). 
The  systematic  introduction  of  new  rectangles  completes  the  proof. 

Proof,  Part  II.  We  now  see  how  the  general  situation  for  Theorem  5.4 
can  be  transformed  into  the  structure  assumed  in  Part  I  of  the  proof.  The 
hypotheses  of  the  theorem  are  assumed  to  hold. 

1.  By  Lemmas  5.3  and  5.1  there  is  a  real-valued  function  w  on  Xt  x  Xz 
that  is  continuous  in  x  *6,  and  satisfies 

x  <  y  o  w(x)  <  w(y),  for  all  x,y  e  X.  (5.22) 

With  (a,  b)  G  Xx  x  Xt  fixed  let  Wifo)  =  w(xlf  b)  and  h’8(x8)  =  w{a,  *s)  for 
all  xx  e  Xx,  xt  e  Xt.  By  Lemma  5.4  and  Exercise  3.16,  w,  is  continuous  in  T5, 
and  Wj  —  e  X()  is  a  nondegenerate  interval  in  Re.  Let  ft,  be  the 

relative  usual  topology  on  Wt.  Each  (W,,  ftt)  is  a  connected  and  separable 
topological  space. 

2.  Let  v  on  Wx  x  W%  be  defined  by  ^aC**))  —  w(xu  **)■  From 

step  1  it  follows  that  v  is  well  defined  and  increases  in  both  components. 
Defining  <  *  on  Wx  X  by 

(c,  d)  <  *  (e,f)  o  vie,  d)  <  v(e,f)  (5.23) 

it  follows  from  (5.22)  that 

(m-i^i).  *Vj(arg))  <  *  (»>x(yx),  wz(yz))  o  (xx,  xt)  <  (yx,  yt).  (5.24) 

Hence  <*  is  a  weak  order  and  it  satisfies  (c,  d)  <  ( e,f )  =>  (c,  d)<*  ( e,f ), 
similar  to  (5.18).  it  remains  to  show  that  Q\  and  Q3  hold  for  <*  on 
Wx  x  Wt. 


Tope  topical  Theory  for  a  Factors 


71 


3.  For  Ql  suppose  for  Wx  x  Ws  that  (r1,  c%,  c1)  £»  (rf1,  d*,  i8)  and 
(c*  <*  tf1,  r8  <  *  rf*).  We  need  to  obtain  d%  <  *  c8.  Let  (x(,  z0  for 1,  2,  3 
satisfy  (c[,  c\)  —  (wfx^,  w2(a|)).  Define  y,l,  y*\  yj  equal  to  «*,  *3,  xf  according 
to  the  permutations  (for  i  —  1  then  /  *=  2)  that  establish  ( c l,  c*>  c*)  £a 
(dl,  dz,  d%).  Then  ( xl ,  a*,  a?3)  £s  (y1,  y8,  y3)  and  by  (5.24)  and  Ql  for  <  on 
Aft  x  yF2,  y3  <  ar3.  Hence,  again  by  (5.24),  <P  c*. 

4.  To  establish  £3  for  <  *  we  note  first  that  v  is  continuous  in  Wx  and  in 

W%.  For  the  Wx  proof  let  Wx(zt)  =  x2):xl  e  TJ  for  each  xx  e  Xif  so 
that  yVi(xt)  is  an  interval  for  each  za.  By  Definition  5.2  we  are  to  show  that 
{c:c  e  Wx,  v(c,  d)e  A)  e  when  de  tV2  and  4  e  TL.  Let  xa  e  Xs  satisfy 
w»(xt)  —  d.  Then  {c:c  e  fVx,  v(c,  d)  e  A  j  =  {»'i(*i):a:i  e  e4)  = 

{wfo,  A) :  xx  e  Xx,  w(xx,  xa)  e  A  n  Wx(xs)}.  Since  w(xx,  xa)  <  w(x'v  xff  o 
w{xx,b)  <  w(x[,  b),  it  follows  from  the  continuity  of  w  that  if  A  A  Wx{x^) 
is  an  open  interval  in  Wx(xa)  then  {wi(aft,  b):xte  Xt,  w(xx,  xt)e  A  n  Wfxff) 
is  an  open  interval  in  Wx  and  hence  that  {c:c  e  Wu  t>(c,  d)e  A)  e  3tx.  Thus, 
if  A  6  CIL,  then  in  general  (c:c  e  Wlt  v(c,  d)  e  A}  e  3tv  (See  Exercise  19.) 
Hence  v  is  continuous  in  Wv  The  proof  for  Wa  is  similar. 

Now  suppose  (c,  d)  <*  ( e,f ).  Then  v(c,  d)  <  v(e,f)  by  (5.23).  Since  v 
increases  and  is  continuous  in  each  component,  it  is  easily  seen  that  there  are 
intervals  Ri(c),  Rx(e)  e  and  Ra(d),  R2(f)  e  &2  such  that  (c,  d)  e  Rx(c)  x 
Rt(d),  (e,f)  E  Ri(e)  x  R2(f),  t  <*  (e,f)  for  all  t  E  Rx(c)  X  Rt(d)  and 
(c,d)<*  t  for  all  /  e  Rx(e)  x  Ra(f).  This  is  condition  2  of  Theorem  3.5  in 
the  <*  context.  It  then  follows  from  that  theorem  that  {r:re  Wx  x  Wit 
r  <*  t]  e  5 ijt  x  tit*  and  {/-.relfiX  fVg,  /<*/•}  6  5 x  5i2  for  each 
t  E  Wx  X  Wa,  which  is  Q3  for  <*. 

Thus,  all  the  hypotheses  of  Part  I  hold  for  <*  on  1VX  x  W%,  so  that  there 
are  real-valued  continuous  functions  vx  on  Wx  and  va  on  W2  that  satisfy 
(c,  d)<*  (e,f)  <=>  vx(c)  4-  vt(d)  <  ^(e)  4  vt(f)  and  are  unique  up  to 
similar  positive  linear  transformations  when  vx  +  r2  additivity  holds. 
Defining  «<(«,)«  we  then  get  (*i,  xa)  <  (ylt  y2)  o  ux(xx)  + 

ua(x2)  <  Ui(yx)  -f-  w2(y2).  Because  w,  on  Xt  is  continuous  and  v{  on  W,  is 
continuous,  m,  on  AT,  is  continuous  (Exercise  16).  ♦ 

Three  or  More  Factors 

Provided  that  at  least  three  factors  actively  influence  preferences,  or  are 
essential  to  use  Debreu’s  term,  Debreu’s  additivity  theory  with  n  3  requires 
only  the  m  =  2  part  of  condition  C  in  Theorem  4.1.  For  ready  comparison 
with  Theorems  5.4  and  5.3  we  state  his  theorem  as  follows. 

THEOREM  5.5.  Suppose  *  =  Ulr  Xf,  n  ^  3,  <  on  X  is  a  weak  order, 
x  <  y  for  some  x,y  e  X  that  differ  only  in  the  /th  components  (/  =  1 , . .  .  ,  n), 
and  the  following  hold  throughout  X: 


72 


Additive  Utilities  with  Infinite  Sets 


Q[*.  [(*,  z)  E%  (y,  w ),  x  <  y  or  x  y]  =>  not  z  <  w. 

02*.  (Xit  Vi()  is  a  connected  and  separable  topological  space  for  i  * 

03*.  {x:x  e  X,  x  <  y)  e  Hf}  V,  and  {x:x  e  X,  y  <  x)  e  JJ^i 

Then  there  are  real-valued  functions  uu  ...  ,un  on  Xlt ...  t  Xn  respectively 
that  satisfy  (5.8),  and  uu  . . . ,  un  satisfying  (5.8)  are  continuous  in  IS*, ... , 
respectively  and  are  unique  up  to  similar  positive  linear  transformations. 

Proof,  Part  I.  As  for  the  preceding  theorem  we  consider  first  the  case 
where  each  X{  is  a  nondegenerate  real  interval,  is  the  relative  usual 
topology  for  Xu  and  (5.18)  holds  (x  <  y=>x  <  y)  along  with  the  other 
hypotheses  of  Theorem  5.5. 

1.  For  the  same  reasons  given  in  Step  1,  Part  I  of  the  preceding  proof,  and 
by  Lemma  5.4,  there  is  a  continuous  (in  *5,)  <  —  preserving  real-valued 
function  v  on  X  that  is  continuous  also  in  any  combination  of  factors. 
Moreover,  (5.19)  holds. 

2.  Following  Debreu  (pp.  22-24)  we  consider  first  an  additive  representa¬ 
tion  for  Xi  X  Xt.  With  <7,  e  Xt  on  the  interior  of  Xt  for  i  >  2,  let 

H  «  X,  x  Xg  x  {a3j  x  •  •  •  x  {«„} 

and  let  <°on)f1  x  X2  be  the  weak  order  induced  by  the  restriction  of  <  on 
H.  By  01*,  <°  is  independent  of  the  particular  a3, ...  ,an  values  used. 
Moreover,  the  conditions  in  the  first  paragraph  of  the  preceding  Part  I  proof 
apply  to  <°  on  Xx  X  X2:  03  follows  easily  from  Theorem  3.5,  but  Ql  (the 
C3  condition)  is  more  difficult  to  verify. 

3.  Because  of  continuity  and  x  <  y  =>  x  <  y,  the  former  Part  I  proof 
used  only  the  indifference  part  of  Ql  in  the  two  forms  shown  in  Figure  5.7. 
Form  I  was  used  to  establish  additivity  on  [a,  6]  x  [c,  d]:  Form  II  was  used 
in  extending  additivity  to  all  of  Xx  x  X2.  In  either  case  the  ~  part  of  01  says 


Form  I 


Form  II 


Figure  5.7 


Topological  Theory  for  »  Factors 


73 


that  (P~P',  Q~  Q')^>  R~  r'  and  (Q  ~  Q‘ ,  Z?~  /?')=>  P  ^  />',  To 

show  that  these  hold  on  H  we  show  first  that  they  hold  for  sufficiently  small 
rectangles  in  H. 

4.  Let  Xj  <  zt  and  x2  <  zt  with  the  differences  z1  —  xt  and  zt  —  x2  suffi¬ 
ciently  small  so  that  there  will  be  a  point  W’  =  (xl5  b3 . bn)  e  X  that 

is  indifferent  to  W  =  (zj,  zt,  a3, . . .  ,  a„ )  G  ZL  This  is  shown  on  Figure  5.8 
and  follows  from  continuity  and  the  fact  that  the  cr,  were  chosen  on  the 
interiors  of  the  Xt.  Let  y,-,  /,■  6  Xt  for  /  =  1,2  be  such  that  x(  <  yt.  <  /.  <  zt 
and  such  that  Q  ~  Q’  and  P~p\  Because  IV ~  W'  there  is  a  Q*  — 
(xi>  **«  c3, . . .  ,  c„)  in  the  indifference  set  (hypersurface)  containing  Q  and 
Q.  Let  T,  T\  R,  and  R '  be  positioned  as  indicated.  Then,  by  Q\ * 
KC*»  R)  Et  «?'.  T),  Q*  ~  Q']=>  R-~  r,  [(Q+,  R')  £2  ( Q ,  T'),  Q*~Q]=> 
R  ~T ,  and  [(/*,  T )  Et(P  ,  T),  P  P')  r>  T  ~  7"',  so  that  Z?  ~  R'  by 
transitivity.  By  a  similar  analysis  (take  R  -  R’,  then  position  P,  P'),  we  have 

(Q  ~  Q  i  z?  ~  £')  =>  z*  -w  />', 


74 


Additive  Utilities  »itk  Infinite  Sets 


5.  Suppose  Pr^p'  and  Q  ~  Q'  as  in  Form  I,  Figure  5.7.  By  repeating  the 
procedure  used  for  positioning  the  new  point  P”  in  Figure  5.3  we  obtain  a 
succession  of  such  points  and  their  associated  indifference  curves  that  proceed 
toward  the  lower  left  corner  of  the  rectangle  that  has  P  and  P f  (Figure  5.7) 
as  two  corner  points.  Using  the  construction  procedure  of  Figure  5.3  at  each 
step,  the  rectangle  is  divided  into  many  small  rectangles.  After  some  suffi¬ 
ciently  large  number  of  steps,  the  VP  ^  W'  condition  of  step  5  above  will 
apply  to  each  2x2  block  of  four  small  rectangles,  and  hence  Ql  holds  in 
these  cases.  Beginning  in  the  lower  left  corner  and  using  the  Q 1  condition  on 
the  2  x  2  blocks,  one  can  show  that,  for  each  small  rectangle,  the  lower  right 
corner  is  indifferent  to  the  upper  left  corner.  Using  transitivity,  this  leads  to 
R R'.  Similarly,  if  R  — -  R'  and  Q  ~  Q’,  we  find  (by  working  into  the 
middle  from  the  lower  left  and  upper  right  corners)  that  P~P',  It  then 
follows  from  the  former  Part  I  proof  that  additive  utilities  hold  in  the 
rectangle  with  corners  P  and  P‘  in  Form  II  of  Figure  5.7  and  from  this  it 
follows  that  Ql  holds  for  Form  II.  Thus  Ql  holds  in  general  for  H. 

6.  We  know  that  additive  utilities  exist  for  H.  Proceeding  by  induction 
assume  that  for  each  i  from  1  to  k  —  1  (j>2)  there  is  a  continuous,  increasing 
real-valued  function  w<  on  A',  such  that  the  indifference  hypersurfaces  in 

TTt  ’i  xi  (i  e->  II?-t  Xi  x  11?-*  {«*»  are  represented  by  «<(*<)  ■» 
constant.  Following  Debreu  (p.  24)  we  extend  additivity  to  Xt. 

It  follows  from  Ql  *  and  step  1  that  w<(*,)  =  «<(&)  <=>  v(.xi,  •  •  •  » 

1.  xk>  <*k+i,  ■  •  •  >  O  *  Kyi,  •  •  •  >  Vk- i.  **»  ak+i>  •  •  •  .  <*»)•  Hence,  we  can 
define  a  real- valued  function  /  on  ui(xi):xt  £  Xt  for 

k  -  1}  x  Xk  by 

k-\ 

/(a,  xk)  =  v(xu .  ..,xk,ak+l, ..  . ,  aj  when  ^«((x()  =  a  for  some  x(. 

The  /increases  in  each  component  and  is  continuous  since  v  is  continuous. 

Let  £)  —  {u(*j, . . .  ,xk,  <jfc+l, . . . ,  an):zte  X,  for  /  *  1 . k),  a  real 

interval.  With  to  e  Cl,  the  set  of  all  (a,  xk )  pairs  that  satisfy 

/(a,  xk)  »  oj  (5.25) 

represents  an  indifference  hypersurface  in  XI?=i  Xt.  Clearly,  given  (xk,  to)  e 
Xk  x  Q,  if  (5.25)  holds  for  some  a  =  2*-1  «,(*,),  this  a  is  unique;  we  shall 
call  it  g(xk,  to).  It  follows  that  the  co  indifference  hypersurface  represented  by 
(5.25)  can  be  thought  of  also  as  the  set  of  all  (x„  . . .  ,  xk)  for  which 

*-i 

2  (5-26) 

i=l 

Let  G,  a  subset  of  Xk  x  Cl,  be  the  domain  of  definition  of  g.  With  T3n  the 
relative  usual  topology  for  Cl,  the  applicable  topology  for  G  is  = 
{G  n  A  :  A  eTSk  x  TSn}.  g  is  continuous  in  (See  Exercise  22.) 


Topological  Theory  fat  n  Factors 


75 


7.  Let  (ak,  a >°)  be  in  the  interior  of  G,  and  take  (y,(nx), . .  . ,  «*_i(<**_x)) 
from  the  interior  of  {(u^xj, . . .  ,  «w_i(a^_i)):2f~l  »<(*,•)  «  £(<**>  to0)}: 

jw,(o<)  +  ~  g{<*k>  ®°)-  (5.27) 

Next,  let  (xt,  (o)  e  G  be  near  enough  to  (a*,  o°)  so  that  the  operations  used 
with  (5.28)  and  (5.29)  are  possible.  Select  (cx, . . .  ,  c*_x)  e  U*-1  Xt  for  which 

4*  uk-i(c*~i)  =  «(**>  w0)  (5.28) 

i 

A— 2 

%Ui(q)  +  ufc_x(«jt_x)  =  g(fl*.  to).  (5.29) 

l 

By  (5.27)  and  (5.28),  (at*. ....  a*_2,  o*_x,  «*)  ~  (aj, . . . ,  a*_2,  c*_x,  x*)  since 
both  are  on  the  to0  indifference  hypersurface.  Then,  by  Ql*,  (cx, . . . ,  c*_2, 
a*)  ~  (cl9 . . . ,  ct_2,  Cjj,..!,  xfc).  Since  the  first  of  these  is  on  the  a> 
indifference  hypersurface  by  (5.29),  so  is  the  latter: 

k- 2 

VUj(c£)  +  Ufc-iC^-i)  ~  g(**>  «)•  (5.30) 

1 

Subtracting  (5.27)  from  (5.28)  and  (5.29)  from  (5.30)  we  get 

g(xk,  to)  *  g(ak,  to)  +  g(zk>  co°)  -  g(ak ,  <u°).  (5.31) 

8.  Let  P"  be  a  rectangle  in  G  whose  sides  are  parallel  to  the  axes  (of  Xk  and 
Q)  and  which  contains  (ak,  a>°)  and  permits  the  operations  used  on  (5.28) 
and  (5.29)  for  each  (a;*,  to)  e  V.  By  (5.31)  and  Lemma  5.4,  g  on  V  can  be 
written  as  the  sum  of  increasing  continuous  functions  of  oj  and  xk,  say 


g(xk,  to)  =  A (cu)  -  uk(xk),  (5.32) 

This  analysis  applies  to  each  ( ak ,  to0)  in  the  interior  of  G:  each  such  (ak,  a>°) 
will  have  an  associated  V  rectangle  in  G  within  which  g  can  be  decomposed 
as  in  (5,32).  Suppose  V  n  V  ^  0  with 

g(xk,  to)  *  h(to)  -  uk(xk)  for  (xk,  to)  e  V  (5.33) 

g(xk,  to)  ~  h’(co)  ~  uk(xk)  for  (xk,  to)  e  V'.  (5.34) 

Fix  ( b ,  to0)  e  V  D  V  and  transform  h'  and  u'k  by  adding  constants  so  that 

h'(to°)  =  h(to°)  and  ul(b)  =  uk(b).  (5.35) 


Suppose  (**,  to)  e  V  n  V.  Then,  by  the  parallel  sides  condition,  (b.  a>)  and 
( xk ,  co°)  are  in  V  n  V,  Hence,  using  (5.33)  and  (5.34), 

«*(**)  =  —  ?(**.  to0)  +  A  (to0) 

«*(**)  =  ~ to0)  +  A '(to0) 


76 


Additive  Utilities  with  Infinite  Sets 


so  that  uk(xk)  =  on  using  (5.35).  Similarly,  h(co)  =  A'(w).  Thus,  under 
the  alignment  of  (5.35),  (A,  ufc)  =  (A\  u'k)  on  V  n  V.  It  follows  that  A  and  ut 
cart  be  defined  so  as  to  satisfy  (5.32)  throughout  the  interior  of  G.  Continuity 
then  insures  that  (5.32)  holds  on  all  of  G. 

9.  Substitution  of  (5.32)  into  (5.26)  yields  “<(*<)  “  A(co)  as  a  repre¬ 
sentation  of  the  co  indifference  hypersurface  in  Xt.  By  induction,  each 
indifference  hypersurface  In  X{  can  be  represented  by  ut(xd  — 
constant,  with  each  uf  continuous  and  increasing  in  x{. 

Proof,  Part  II.  The  proof  that  the  general  situation  of  Theorem  5.5  can 
be  transformed  into  the  structure  of  Part  I  in  this  proof  is  similar  to  the 
Part  II  proof  of  Theorem  5.4.  ^ 


5.5  SUMMARY 

Although  a  very  general  theory  of  additivity  has  been  developed  by 
Tversky  (1967),  it  becomes  somewhat  complex  and  difficult  to  interpret  in 
an  easy,  way  when  infinite  sets  are  involved.  The  reader  interested  in  a  very 
general  theory  should  consult  this  paper. 

When  rather  strong  structural  conditions,  such  as  weak  order,  X  = 
nr„x  solvability,  and  so  forth  are  assumed  to  hold,  less  general  but  more 
easily  interpreted  additivity  theories  result.  One  of  these,  developed  by  Luce 
and  Tukey  (1964)  and  Luce  (1966),  is  algebraic  in  nature  and  involves  the 
assumption  that  differences  in  the  levels  of  some  factors  can  be  offset  (in  the 
preference  sense)  by  compensating  differences  in  the  levels  of  other  factors. 
As  shown  by  Luce  (1966)  it  is  possible  to  weaken  this  unrestricted  solvability 
condition  and  still  obtain  results  similar  to  those  in  Section  5.2.  The  theory 
that  results  from  restricted  solvability  is  very  similar  to  the  topological 
additivity  theory  of  Debreu  (1960)  as  reviewed  in  Section  5.4.  In  all  the 
theories  noted  in  this  paragraph,  the  independence  condition  C3  of  Theorem 
4.1  is  sufficient  for  additivity,  but  C2(m  =  2)  can  be  used  when  there  are 
more  than  two  factors  because  C3  then  follows  from  C\  and  the  other  con¬ 
ditions.  In  these  well-structured  theories  additive  utilities  are  unique  up  to 
similar  positive  linear  transformations,  and  in  Debreu’s  theory  each  on 
Xi  is  continuous  in  the  topology  associated  with  (<,  Xt). 


INDEX  TO  EXERCISES 

1-3.  Lexicographic  orders  and  additivity,  4.  mr  +  w  =  (m  +  n)x.  5-6.  Strictly  ordered 
groups.  7.  Commutative  group.  8.  Similar  positive  linear  transformations.  9.  Unbounded 
utilities.  10.  Countable  sets  applicability.  11—12.  Closure.  13.  Product  topology.  14-15. 


Exercises 


77 


Products,  intersections,  unions.  16-22.  Continuity,  23-24.  Insufficiency  of  C2(P\*,  Ql*) 
when  n  —  2.  25.  Mean-variance  criterion  for  normal  probability  distributions. 


Exercises 

1.  For  each  of  the  following  cases  X  ■»  X1  x  X2  and  x2)  -<  (yx,  o 
(zj,  x2)  <L  (yx,  y2)  oxx  <  yx  or  (a:*  =  yx,  x2  <  y2).  Verify  the  assertions  made. 

a,  Xx  —  {0, 1},  X2  =  {r  :r  is  a  rational  number}.  Additive  utilities  exist. 

b.  Xj  —  { r:r  is  a  rational  number},  X2  ™  {0, 1},  Additive  utilities  don’t  exist. 

t.  X1  —  X2  —  {j\j  is  an  integer}.  Additive  utilities  exist. 

2.  ( Continuation .)  Even  though  additive  utilities  exist  in  Exercise  lc,  (Y,  +,  <L) 
as  defined  preceding  Theorem  5. 1  for  this  case  is  a  non-Archimedean  strictly  ordered 
group  and  therefore  there  is  no  /  on  Y  that  satisfies  (5.3)  and  (5.4).  Discuss  this 
situation  further. 

3.  Let  X  -  Xx  x  X2  x  X2  with  Xx  =  {0, 1},  X2  -  {1,  2, .  . .}. 

a.  If  X3  =  {0, 1}  prove  that  additive  utilities  do  not  exist  when  *  -<  y  <=>  x  <L  y, 

b.  If  X3  —  Re  and  x  -<  y  o  x  <L  y  show  that  there  is  a  countable  subset  of  X 
that  is  •< -order  dense  in  X. 

4.  Let  ( Y,  +)  be  a  group.  Show  that  if  m  >  0  and  n  <  0  are  integers  then 
mx  +  nx  a*  (m  +  n)x  whenever  xeY. 

5.  Prove  that  a  strictly  ordered  group  is  Archimedean  when  (5.3)  and  (5.4)  hold. 

6.  With  Lx  and  Ue  as  defined  in  the  proof  of  Theorem  5.1 ,  prove  that  there  is  a 
unique  real  number  g{x)  such  that  m/n  ^  g(x)  ^  rjs  for  all  mjn  e  Le  and  r/s  e 

7.  Let  Y  —  {0,  a},  a  9*  0,  and  define  ma  =  0  when  m  is  an  even  integer  and 
ma  =  a  when  m  is  an  odd  integer.  Define  -I-  fully  so  that  (  Y,  +)  is  a  commutative 
group. 

8.  Show  that  additive  utilities  are  unique  up  to  similar  positive  linear  transfor¬ 
mations  when  X  —  {xx,  yx)  x  {x2,  y2}  and  (xXt  x£  -<  (xx,  y2)  ~  {yx,  x2)  -<  (yx,  y2). 
(Assume  that  -<  is  a  weak  order.) 

9.  Verify  that  ui.uer  the  hypotheses  of  Theorem  5.2  ux  and  u2  in  (5,6)  must  be 
unbounded  when  x  -<  y  for  some  x,  y  e  X. 

10.  When  x  -<  y  for  some  x,  y  e  X,  can  Xx  and  X2  be  countable  under  the 
hypotheses  of  Theorem  5.2?  Is  the  same  thing  true  for  the  hypotheses  of  Theorem 
5.4? 

11.  Let  X  =  Re  with  the  usual  topology  clL.  Specify  the  closure  of  (a)  {0,  1, 
2, .  .  .};  ( b )  {r: 0  <  r  <  1  and  r  is  rational};  (c)  {l/rr.rt  =  1, 2,  3, .  . .};  ( d )  {m/ 
2 n:m  =  0,  1, .  . .  ,  2 n,  n  =  1,2,...}. 

12.  Let  X  be  all  rational  points  in  Re,  with  the  relative  usual  topology  {X  n 
A:Ae  cu;}.  What  is  the  closure  of  X' 

13.  Let  (A'j,  "G,)  be  a  topological  space  for  i  =  1,2 and  let  "Bj  be  the 


78 


Additive  Utilities  with  Infinite  Sets 


family  of  sets  formable  by  arbitrary  unions  of  the  sets  in  A{:A(  6  'Bi  for 

i  Prove  that  JJ*  V{  **  JJ T,{. 

14.  Let  X  **  UU  A\  £  Xt  for  j  —  1 , . . . ,  m  and  /  »  1 , . . . ,  n.  Prove  that 

15.  ( Continuation .)  Let  X  —  A),  Ai(t)  £  A',  for  all  re  T,  where  T  is  an 

arbitrary  set.  Verify 

a •  UU  •ddO)  ®*  (UU 

b-  UteT  (TT"“1  £  UU  (UteT  Afl)), 

c.  Show  by  example  that  Q2  (fj  Aft))  can  be  a  proper  subset  of  j_|  ((J  T /<,(/)). 

16.  Let  ( X ,  31),  (  Y,  S),  and  (Z,  13)  be  topological  spaces  and  suppose  f  on  X 
into  Y  is  5t  —  8  continuous  and  g  on  Y  into  Z  is  8  —  TS  contir  ous.  Let  k(x)  — 
g(f(xY)  for  aii  x  e  X.  Prove  that  h  is  at  —  ”6  continuous. 

17.  Using  the  first  part  of  the  proof  of  Lemma  5.4  as  a  guide,  prove  that  if 
(X,  A),  ( Y,  8),  and  ( Z ,  IS)  are  topological  spaces,  if  /  on  X  into  Y  is  it  —  S 
continuous  and  if  ^  on  Af  into  Z  is  31  —  "5  continuous,  then  h  on  X  into  Y  x  Z, 
defined  by  h(x)  —  (f(x),gix)),  is  31  —  (S  x  15)  continuous. 

18.  With  *11  the  usual  topology  for  Re,  verify  that  A  e  *11  if  and  only  if  A  is  the 
union  of  open  intervals  in  Re. 

19.  ( Continuation .)  Argue  that  a  real  function /on  X  is  continuous  in  the  topology 
75  for  X  if  and  only  if  A  is  an  open  interval  in  Re  implies  that  f~\A)  =  {x:xgX, 
fix)  G  A}  is  in  15. 

20.  With  ( X ,  3t)  and  (  Y,  S)  topological  spaces  and  /  a  function  on  X  into  Y, 
let  f{X)  *  [y.ye  Y  and  y  -  fix)  for  some  x  e  X}.  Show  that  if/  is  at  —  s  con¬ 
tinuous,  then  f  is  3t  —  {f{X)  n  S:Se  8}  continuous.  Thus,  a  continuous  function 
is  continuous  also  with  respect  to  the  relative  topology  for  its  range. 

21.  Let /  be  a  real,  strictly  increasing  (or  strictly  decreasing)  function  on  a  real 
interval  [a,  b\,  and  suppose  that  the  range  of  f,fiX),  is  a  real  interval.  Prove  that 
/  is  continuous. 

22.  Suppose  X,  Y,  and  Z  are  real  intervals,  /  on  X  x  Y  onto  Z  is  strictly  in¬ 
creasing  in  each  variable  and  is  continuous.  For  each  iy,  z)  e  Y  x  Z  for  which 
there  is  an  xe  X  that  satisfies  fix,  y )  =  z,  let  g(y,  z)  equal  *  when  fix ,  y)  «*  z. 
Let  C  £  Y  x  Z  be  the  domain  of  g.  Prove  that  g  is  continuous. 

23.  Given  X  ~  (1,  oo)  x  [1,  ao)  and 

u(zx,  xs)  =  xxx2  +  z**  for  each  {xu  x^  e  X, 

suppose  ixi.xf)  <  (yi,yz)  if  and  only  if  uixx,  x2)  <  u(yr,  y2),  for  all  (xlf  Xg), 
iy t,  yz)  G  X.  Verify  that  all  the  hypotheses  of  Theorem  5.4  hold  except  for  Q 1,  and 
that  Q 1*  holds  for  this  case  (i.e.,  Q).  Do  additive  utilities  exist  in  this  case?  Why 
not? 

24.  iContinuation;  due  to  David  Krantz.)  Given  X  =  (0,  ao)  x  (0,  oo)  and 

xtx2  +  if  1  ^  xx,  1  ^  xs 

u(x j,  x8)  =  Xj(x2  +  1)  if  0  <  xj  ^  1  ^  x2 

2xxxt  if  0  <  x,,  0  <  x2  £  1 


Exercise* 


suppose  (*i,  rrj  <  (y„  y*)  if  and  only  if  u(xu  <  a(ylt  t/*),  for  all  («,,  *,), 
<J/i»  tfa)  e  ^  Verify  that  all  hypotheses  of  Theorems  5.2  and  5.4  hold  with  the  ex¬ 
ception  of  PI  or  Q\ ,  and  that  PI  fads  but  PI*  or  Cl*  holds. 

25.  Let  A"  be  the  set  of  all  normal  probability  distributions  on  the  real  line.  Each 
such  distribution  is  completely  known  when  its  mean  n  and  standard  deviation 
<y  (2>0)  are  specified,  so  that  we  can  represent  X  by  the  set  X‘  of  alt  ordered  pahs 
(/<,  a)  for  which  i*  is  a  real  number  and  a  ^  o.  If  the  hypotheses  of  Theorem  5.4 
hold  for  •<  on  Xf  then  there  are  continuous  real-valued  functions  /  on  ( -  oo,  oo) 
and  g  on  [0,  oo)  such  that,  for  every  {jt,  a)  and  (ft*,  <r*)  in  X', 

(/*,  a)  -<  (fi*t  a*)  o  f(j*)  -l  g(?)  < /(/**)  +  g(a*). 

If  you  are  familiar  with  normal  probability  distributions,  comment  on  the  reason¬ 
ableness  of  the  hypotheses  (in  particular  Cl)  in  the  case  where  each  normal  dis¬ 
tribution  represents  a  course  of  action  that  is  a  gamble  for  amounts  of  money. 


Chapter  6 


COMPARISON  OF  PREFERENCE 
DIFFERENCES 


Alt  preference  axioms  in  preceding  chapters  and  those  in  Parts  11  and  III 
involve  only  simple  preference  comparisons  «).  In  this  chapter,  however, 
we  shall  consider  a  “strength-of-preference”  notion  that  involves  comparisons 
of  preference  differences.  We  will  use  a  binary  relation  <  *  on  pairs  of  ordered 
pairs  in  X  x  X. 

We  interpret  (x,  y)  <*  (z,  w)  to  mean  that  the  degree  of  preference  for  x 
over  y  is  less  than  the  degree  of  preference  for  z  over  w.  The  “degree  of 
preference”  for  x  over  y  can  of  course  be  “negative”  if  y  is  preferred  to  x. 

For  conceptual  clarity  I  shall  use  x  —  y  to  denote  an  ordered  pair  (x,  y)  e 
X  x  X'.x  —  y  —  (x,  y).  Thus,  x  —  y  <*  z  —  w  will  be  used  in  place  of,  and 

is  identical  to,  (x,  y)  <*  ( z ,  w).  This  notation  suggests  some  conditions  that 

may  clarify  the  notion  of  directed  preference  difference  comparisons,  such  as 

x  —  y  <  *  z  —  H’  vv  —  z  <  *  y  —  x,  (6.1) 

x  —  y  <  *  2  —  w  =>  x  —  z  <  *  y  —  w.  (6.2) 

In  our  utility  representations  x  —  y  <*  z  —  w  will  be  associated  with 
h(x)  —  u(y)  <  u(z)  —  w(h).  In  distinction  to  this  approach  Suppes  and 
Winet  (1955)  work  with  undirected  or  absolute  difference  comparisons  and 
associate  the  preference  degree  between  x  and  y  with  |w(x)  —  w(y)|.  They  use 
also  a  simple  preference  relation  (<).  With  directed  differences  <  ccn  be 
defined  directly  from  <*,  such  as 

z  <  y  <z>x  —  x  <*  y  —  x,  (6.3) 

but  at  least  one  author,  Armstrong  (1939),  has  taken  issue  with  this.  His 
idea,  which  is  not  in  vogue  today,  was  to  take  <  *  as  a  precisely  “measurable” 
notion  so  that,  for  example,  if  x  <  y  then,  by  gradual  changes,  one  can 
always  find  a  z  between  x  and  y  so  that  z  —  x  y  —  z.  and  eventually 


sa 


“MeawaWe”  Utility 


81 


obtain  x  —  y  <*  z  —  w  ou(x)  —  u(y)  <  u(z)  —  w(w):  but  at  the  same 
time  he  championed  intransitive  indifference  for  with  *  <  y  only  if  the 
difference  u{y)  —  u(x)  exceeds  a  minimal  positive  threshold  value- 

6.1  “MEASURABLE”  UTILITY 

Before  we  look  at  some  formal  theory,  other  remarks  should  be  made. 
Defining 

x  —  y  z  —  w  o  (not  x  —  y  z  ~  w ,  not  z  —  w  <  *  a;  — •  y)  (6.4) 

it  seems  clear,  Armstrong  (i939)  and  others  to  the  contrary  notwithstanding, 
that  there  is  no  more  (and  probably  less)  reason  to  suppose  that  is 
transitive  than  to  suppose  that  is  transitive.  For  example,  can  you  find 
one  and  only  one  value  of  x  for  which  %x  —  $0 '•'*-*  $100000  —  $x">  If  you 
can,  I  venture  to  say  that  your  discriminatory  judgment  is  rather  more  acute 
than  that  of  most  mortals. 

Since  its  introduction  by  Pareto  (1927,  p.  16)  and  Frisch  (1926),  the  idea 
of  comparable  preference  differences  has  been  severely  criticized,  and  for 
reasons  that  go  deeper  than  the  discriminatory  vagueness  that  may  lead  to 
intransitive  One  charge  has  been  that  the  notion  has  no  operational 
meaning.  Because  of  this,  several  “operational”  modes  for  making  compari¬ 
sons  have  been  suggested,  including  the  following  three,  where  we  assume  for 
convenience  that  y  <  x  <  w  <  z. 

1.  To  compare  x  —  y  and  z  —  w,  compare  a  50-50  gamble  resulting  in 
either  x  or  w  with  a  50-50  gamble  resulting  in  either  y  or  z.  If  the  former  is 
preferred,  take  z  —  w  <*  x  —  y,  and  so  forth. 

2.  To  compare  x  —  y  and  z  —  w  imagine  that  you  already  have  y  and  w 
and  can  either  exchange  y  for  x  or  exchange  w  for  z.  If  you  prefer  the  former 
exchange  take  z  —  w  <  *  x  —  y,  and  so  forth. 

3.  Assuming  that  x,  y,  z,  and  w  are  nonmonetary,  estimate  the  minimum 
bonus  $a  for  which  *  ~  y  4-  $a,  and  estimate  the  minimum  bonus  Sb  for 
which  3  <"**'  tv  -f-  $b.  If  %a  <  %b  take  x  —  y  < *  z  —  w. 

Of  these  three  we  must  reject  the  second  since  it  violates  the  hypothesis 
that  A'  is  a  set  of  mutually  exclusive  alternatives,  in  which  case  it  makes  little 
if  any  sense  to  suppose  that  you  already  have  both  y  and  w.  The  third 
approach,  which  might  appeal  to  some  people,  is  suspect  first  for  the  reason 
that  it  presupposes  a  form  of  independence  between  X  and  the  monetary 
bonuses  (as  in  a  two-factor  situation  in  Chapters  4  and  5)  and  second  that, 
even  if  independence  applies,  there  is  some  question  about  defining  a  strength- 
of-preference  notion  on  the  basis  of  simple  preference  comparisons. 


82 


Comparison  of  Preference  Differences 


This  last  clause  applies  also,  as  noted  by  Weldon  (1950)  and  Ellsberg 
(1954),  among  others,  to  the  50-50  gambles  device.  Simple  comparisons 
between  even-chance  gambles  as  a  basis  for  defining  degree  of  preference 
seem  to  distort  the  notion  introduced  by  Pareto  and  Frisch.  Included  in  this 
distortion  is  the  addition  of  chance,  which  plays  no  part  in  the  basic  notion. 
Along  with  Weldon  and  Ellsberg,  I  would  have  no  quarrel  with  an  individual 
who  judges  that  $30—  $0<*  $100-  $40  but  prefers  an  even-chance 
gamble  between  $30  and  $40  to  one  between  $0  and  $100.  The  latter  judgment 
involves  the  individual’s  attitude  toward  taking  chances,  an  attitude  we  feel  is 
not  part  of  the  <*  notion. 

If  we  do  in  fact  reject  such  approaches  we  may  be  driven  back  to  the  idea 
of  the  early  writers  on  this  subject,  that  <*  comparisons  are  essentially  a 
matter  of  direct  self-interrogation  as  to  whether  your  degree  of  preference 
for  x  over  y  exceeds,  equals,  or  is  less  than  your  degree  of  preference  for  z 
over  w.  As  noted  above,  this  is  rejected  by  some  because  of  its  “nonopera- 
tional”  character. 

Others  dislike  the  idea  of  direct  preference-difference  comparisons  for  the 
reason  that,  under  sufficiently  powerful  conditions  on  <*,  one  must  logically 
accept  the  ability  to  “measure”  preference  differences  introspectively  much 
as  one  would  go  about  measuring  lengths  with  a  measuring  rod.  This 
implication  of  the  “measurability”  of  utility  has  caused  much  commotion  in 
the  literature:  some  writers  who  accept  the  concept  of  simple  preference 
comparisons  find  it  impossible  to  endorse  the  notion  of  “measurable” 
utility.  Pareto,  in  fact,  denounced  the  very  notion  he  introduced  when  he 
found  that  it  was  not  needed  to  derive  certain  results  in  the  theory  of 
static,  riskless,  consumer  demand.  On  the  other  hand,  Frisch  (1964)  remains 
an  advocate  of  “measurable”  utility:  in  the  cited  paper,  on  the  subject  of 
dynamic  (time-dependent)  consumer  demand  theory,  he  points  out  that 
several  attractive  results  cannot  be  obtained  without  some  notion  of 
“measurable”  utility. 

For  some  people,  the  direct,  introspective  “measurability”  pill  may  be 
easier  to  swallow  when  intransitive  is  allowed  to  enter  the  theory. 
Although  our  preference-difference  comparisons  may  not  be  as  precise  as 
length  comparisons  made  with  precision  instruments,  I  do  not  feel  that  this 
is  sufficient  reason  to  abandon  the  idea  of  such  comparisons. 

6.2  THEORY  WITH  FINITE  SETS 

Using  the  method  of  Adams  (1965),  we  now  state  and  prove  two  represen¬ 
tation  theorems  for  preference-difference  comparisons  when  X  is  finite.  Both 
are  incorporated  in  Theorem  6.1.  The  A  <=>  A*  theorem  permits  intransitive 
but  the  B  o  B*  theorem  takes  as  transitive.  The  A  theorem  is 


Theory  with  Finite  Sets  S3 

proved  by  Adams  (1965).  An  equivalent  of  the  B  theorem  is  proved  by  Scott 
(1964). 

THEOREM  6.1.  Suppose  X  i s  finite.  Then 

A.  [x1, .  . . ,  xm,  w1, . . .  ,  wm  is  a  permutation  of  yl, . . , ,  ym,  zl, , . , ,  zm 
and  xj  —  yi  <  *  zi  ~~  w*  for  all  j  <  m]  =>  not  xm  —  ym  <  *  zm  —  wm; 

B.  [x1, . . .  ,  xm,  tv1, ,  tv"*  is  a  permutation  of  yx, . . . ,  ym,  z%, . . . ,  z® 
and  x1  —  y*  <  *  z*  —  w*  or  x1  —  yi  /v*  zi  —  yp  for  each  j  <  m)  =>  not 
xm  —  ym  <*  zm  ~  wm; 

for  all  x*,  y\  zf,  wj  e  X  and  m  =»  2,  3, . . . ,  if  and  only  if  there  is  a  real-valued 
function  u  on  X  such  that,  for  all  x,  y,z,  we  X, 

A*,  x  ~  y  <.*  z  —  w=>  u[x )  —  u(jy)  <  u(z )  —  u(tv); 

B *.  x  —  y  <*  z  —  w  o  u(x)  —  u(y)  <  u(z )  —  u(w). 

It  is  easily  seen  that  A*  =>  A  and  B*  =>  B.  A  does  not  require  <*  to  be 
transitive  although  the  transitive  closure  of  <  *  under  A  is  a  strict  partial 
order.  A  does  not  imply  either  (6.1)  or  (6.2).  On  the  other  hand,  B  implies 
that  ■<*  is  a  weak  order  along  with  (6.1)  and  (6.2).  For  asymmetry,  B  says 
that  x  —  y<*z  —  w=>  not  z  —  w  <  *  *  —  y,  since  x,  z,  w,  y  is  a  permuta¬ 
tion  of  y,  w,  z,  x.  Negative  transitivity  then  follows  from  B:  (not  x  —  y  <* 
z  —  w,  not  2  —  w  <*  r  —  ^)=>  (z  —  w  x  —  y,  r  —  s  <*  z  —  tv)  =>  not 
x  —  y  <*  r  —  s.  With  <  as  defined  in  (6.3),  B implies  that  <  on  X is  a  weak 
order.  Here  and  later,  <*  =  <*  U 

Sufficiency  Proofs.  Let  A  hold.  To  apply  the  Theorem  of  The  Alternative 
(Theorem  4.2)  let  c  —  (u(rl),  u(iz), . . .  ,  u(tN))  where  X  =  {r1, . . .  ,  tN}.  Let 
A  be  the  set  of  ail  *  —  y  <  *  z  —  w  statements.  If  A  —  0 ,  A*  is  immediate. 
If  A  0 ,  each  corresponding  u(x)  —  u(y)  <  u(z)  —  «(tv)  translates  into  a 
c  •  a*  >  0  statement,  which  gives  a  system  like  (4.4).  If  this  system  has  no  c 
solution  then,  by  Theorem  4.2  and  the  fact  that  the  e  {—  1,  0, 1}  for  all  j 
and  k,  there  are  non-negative  integers  rk  at  least  one  of  which  is  positive  such 
that  ]£*  /yzj  =  0  for  j  *=  1 , . . . ,  N.  From  the  original  x  —  y  <  *  z  —  w 
statements  it  then  follows  that  there  is  a  sequence  x1  —  y1  <*  zl  —  tv1, . . .  , 
xn  —  y m  <  *  zm  —  wm  with  x1, . . . ,  xm,  tv1, . . . ,  wm  a  permutation  of 
yl, . . .  ,ym,  z1, ...  ,zn.  If  m  >  1,  this  violates  A.  If  m  =  1,  it  yields 
x  —  y  <*  x  —  y  or  else  x  —  x  y  —  y,  each  of  which  violates  A.  Hence 
there  is  a  c  solution. 

Let  B  hold.  Axiom  B  implies  as  a  special  case  that  if  (in  the  two-dimen¬ 
sional  sense)  ((x1,  y1), . . .  ,  (xm,  ym))  Em  ((z1,  tv1), . .  .  ,  ( zm ,  tv**))  and  if  xi  — 
yi  <  *  z*  —  iv*  or  x‘  —  yi  -w*  zi  —  tv*  for  each  j  <  m ,  then  not  xM  — 
ym  <*  2m  —  wn.  It  follows  immediately  from  Theorem  4. 1C  that  there 


84 


Comparison  of  Preference  Differences 


are  real-valued  functions  ux  and  u%  on  X  such  that  x  ~  y  <  *  2  —  wo 
ux(x)  4  ut(y)  <  Ux(z)  4  Uiiw).  Also,  8  implies  that  x  —  y<*z  —  wo 
w  —  z  <*  y  —  x.  Hence  x  —  y  <*  z  —  w  ux(w)  4  u%(z)  <  ux(y)  4  ut(x). 
Defining  u(x )  =  t/^x)  —  wa(x)  it  then  follows  that  x  —  y<*2  —  wo 
u(x)  -  u(y)  <  u(z)  -  w(w).  ♦ 

6.3  REVIEW  OF  INFINITE-SET  THEORIES 

In  this  section  we  review  some  theories  that  assume  that  <  *  on  X  x  X  is  a 
weak  order  and  imply  that  there  is  a  real-valued  function  u  on  X  satisfying 

x  —  y  <*  z  —  w  o  u(x)  —  u(y)  <  u(z)  —  «(w),  foraflx, y,z,  weJf  (6.5) 

that  is  “unique  up  to  a  positive  linear  transformation.”  This  means  that  if  u 
satisfies  (6.5)  then  v  satisfies  (6.5)  also  if  and  only  if  there  are  real  numbers 
a  >  0  and  b  such  that 

v(x)  —  au(x)  +  b,  for  all  xeX.  (6,6) 

The  two-factor  additivity  theories  of  Chapter  5  can  be  adapted  to  the 
present  case.  Suppose,  for  example,  that  there  are  real-valued  functions  ut 
and  ut  on  X  such  that 

x  —  y  <*  z  —  w  o  ux(x)  4  wa(y)  <  ux(z)  4  «a(w),  for  all  x ,  y,z,we  X , 

(6.7) 

with  ux  and  uz  unique  up  to  similar  positive  linear  transformations.  Suppose 
also  that  (6.1)  holds.  Then,  as  in  the  proof  of  Theorem  6.15,  u  on  X,  defined 
by  u(x )  =  ux{x)  —  u2(x),  satisfies  (6.5).  In  addition,  u  is  unique  up  to  a 
positive  linear  transformation.  For  suppose  that  u  and  v  satisfy  (6.5).  Defining 
ux(x)  =  u(x),  u%(x)  —  — u(x),  vx(x)  =  u(x)  and  t>a(x)  =  —v(x),  it  follows 
from  (6.5)  that  (6.7)  holds  for  (ux,  ua)  and  for  (vx,  »*)•  Since  t>x  is  a  positive 
linear  transformation  of  ux,  v  is  a  positive  linear  transformation  of  u. 

From  this  reasoning  and  Theorem  5.4,  the  following  axioms,  after  Debreu 
(1960),  imply  a  u  for  (6.5)  that  is  continuous  in  IS: 

A\.  x  —  y  z  —  w=>  w  —  z  <*  y  —  x, 

A2.  [((*>,  yl),  (x4,  y4),  (x8,  y8))  £s  ((**,  w1),  (zs,  w4),  (z3,  w8)),  xj  -  y*  -<* 
zi  __  wi  or  x *  —  y1  z*  —  w*  for  j  =  1 , 2]  =>  not  x3  —  y3  <  *  z3  —  w3, 

A3.  (X,  IS)  is  a  connected  and  separable  topological  space, 

A4.  (x  —  y.x  —  ye  X  x  X,  x  —  y  <_*  z  —  w}  e  TS  x  IS  and  {x  —  y:x  — 
y  e  X  x  X,  z  —  w  <*  x  —  y}  e  IS  x  IS  ,for  every  z  —  w  e  X  x  X. 

Algebraic  Axioms 

Suppes  and  Winet  (1955),  Scott  and  Suppes  (1958),  and  Suppesand  Zinnes 
(1963,  pp.  34-38)  present  nontopological  axioms  that  imply  a  u  for  (6.5)  that 


Retit*  of  Infinites*  Theories 


85 


is  unique  up  to  a  positive  linear  transformation.  The  first  four  Suppes-Zinnes 
axioms  are  equivalent  to  5 1  and  B2: 

B 1.  <*  on  X  x  X  is  a  weak  order , 

B2.  (6 .1)  and  (6.2). 

Their  final  three  axioms,  rather  than  using  the  complete  A 2,  are  based  on 
algebraic  conditions.  With  <  on  X  and  on  X  x  X  as  defined  in  (6.3)  and 

(6.4),  *  —  yMh  —  w  means  that  x  —  y  z  —  w  and  y  ~  z.  (That  is,  the 

preference  interval  from  y  to  x  “equals”  the  preference  interval  from  w  to  z 
and  the  two  intervals  are  contiguous.)  Proceeding  recursively,  x  —  yMn+1z  — 
w  means  that  there  are  st  t  e  X  such  that  *  —  yMns  —  t  and  s  —  tMh  —  w. 
The  final  three  axioms  are:  for  every  x ,  y,  z,  w  s  X, 

53.  x  —  s  s  —  y  for  some  s  e  X, 

54.  (y  <  x,  z  —  w  <  *  *  —  y)  =>  (y  <  s  <  x,  z  —  w  ^  *  x  —  s)  for  some 
seX , 

B5.  (y  <  *, x  —  y  <:*  z  —  w)  =>  (a  —  sAfflr  —  w,z  —  s  x  —  y)  for 
some  s,  teX  and  some  positive  integer  n. 

53  is  the  midpoint  or  bisection  axiom,  similar  to  Armstrong’s  notion 
following  (6.3).  In  nontrivial  cases,  53  requires  A'  to  be  infinite.  54  is  like 
a  continuity  condition,  and  55  is  a  structural-Archimedean  axiom.  55  says 
that  if  the  difference  x  —  y  is  “positive”  then,  no  matter  how  large  z  —  w 
happens  to  be,  there  is  an  n  such  that  the  z  —  w  interval  can  be  divided  into 
n  +  1  equal  parts  no  one  of  which  is  larger  than  x  —  y. 

Pfanzagl’s  Theory 

Pfanzagl  (1959)  presents  axioms  that,  under  one  interpretation,  imply  (6.5) 
with  u  unique  up  to  a  positive  linear  transformation.  His  general  theory  uses 
a  set  A"  that  is  connected  (topologically)  and  a  function  /  on  X  x  X  into  X. 
Instead  of  <  *  he  uses  <  along  with /.  However,  in  the  interpretation  of  this 
chapter,  <*  is  not  completely  absent  since / (x,  y)  is  interpreted  as  a  point  in 
X  that  is  midway  in  preference  between  x  and  yt  like  5  in  53. 

In  addition  to  a  continuity  axiom,  Pfanzagl’s  theory  uses  the  following 
assumptions : 

Cl.  <  on  X  is  a  weak  order , 

C2.  x  <  y  =>  f(x,  z)  <  f(y,  z)  and  f(z,  x )  <  /(z,  y)  for  every  z  e  X; 
x~*y=>f(x,z)~  f(y ,  z )  and  f  (z,  a;)  ~/(z,  y)  for  every  z  e  X, 

C3.  /(/Or,  y),/(z,  *’))  ~/(/(*.  2)>/(V.  *0)- 


C3  is  the  bisymmetry  axiom.  These  axioms  (including  continuity)  imply  that 


«j  i Hi  £ 


86 


Comparison  of  Preference  Differences 


there  is  a  real-valued  function  u  on  X  that  satisfies 

x  <  y  o  u(x)  <  u{y)  (6.8) 

«(/(*,  y))  -  pu(x)  +  qu{y)  +  r  (6.9) 

for  all  x,yeX  and  is  unique  up  to  a  positive  linear  transformation. 

Under  the  interpretation  of  /  as  a  midpoint  function,  two  more  axioms 
arise: 

C4.  f(x,x)~x 
CS.  f  {x,  y )  ~f(y,  x). 

When  x  <  y  for  some  x}  y  e  X,  C4  and  C5  require  p  —  q  =  \  and  r  —  0  in 
(6.9).  It  follows  that  fix ,  y)  <  f(z,  w)ou(x )  +  u(y)  <  u(z)  4-  «(w).  (6.5) 
then  follows  when  <*  on  X  x  X  is  defined  as  follows: 

x  —  y<*z~wof(x,w)<  f(z,  y).  (6.10) 


6.4  SUMMARY 

The  notion  of  comparable  preference  differences  is  (with  the  exception  of 
Exercise  17)  the  only  strength-of-preference  or  preference  intensity  concept 
that  appears  in  this  book.  The  additive  utility  theories  of  Chapters  4  and  5, 
although  mathematically  similar  to  the  theories  in  this  chapter,  are  based 
solely  on  simple  preference  comparisons  and  involve  no  higher-order  prefer¬ 
ence  concepts. 

With  *  —  y  <*  z  —  w  interpreted  as  “your  degree  of  preference  for  z  over 
w  exceeds  your  degree  of  preference  for  x  over  y"  the  conditions  that  relate 
x  —  y  <*  z  —  to  u(x)  —  u[y)  <  u(z)  —  u(w)  are  similar  to  the  conditions 
used  in  two-factor  additivity  theories.  Exceptions  to  this  arise  in  (6.1)  and 
(6.2),  which  are  addressed  specifically  to  the  preference-difference  notion  and 
have  no  counterparts  in  preceding  chapters. 


INDEX  TO  EXERCISES 

1-3.  Even-chance  gambles  theory.  4-6.  (6.1)  and  (6.2).  7-9.  (6.3).  13.  Condition  B. 
11.  Semiordered  preference  differences,  12-13.  Algebraic  theory.  14-16.  Pfanzagl’s  con¬ 
ditions.  17-18.  “Twice  as  happy." 


Exercises 

1.  Interpret  (x,  y)  -<  (z,  w)  to  mean  that  a  50-50  gamble  between  2,  w  s  X  is 
preferred  to  a  50-50  gamble  between  x,  y  e  X.  Assuming  that  X  is  finite,  give 


Exercises 


87 


necessary  and  sufficient  conditions  for  -<  on  X  x  X  for  each  of  the  following  two 
utility  representations;  ( a )  (#,  y)  -<  (2,  w)  - >  u(x)  +  u(y )  <  «(*)  +  h(h>);  (6) 
(x,y)  <  (2,  w)  o  u(x)  +  u{y)  <  uiz }  +  u(w). 

2.  ( Continuation .)  Using  Theorem  5.4  argue  that,  when  A l  of  Section  6.3  is 
replaced  by  (x,  y)  ~  ( y ,  x)  for  all  x,  y  e  X,  and  («<*,  ~*)  in  A2,  A 3,  and  A4  is 
replaced  by  (-<,  ~),  then  there  is  a  real-valued  function  u  on  X  that  satisfies 
( x ,  y )  -<  (2,  w-)  o  u{x)  4-  u(y)  <  u(z)  +  u(w)  and  is  continuous  in  ts  and  unique 
up  to  a  positive  linear  transformation. 

3.  ( Continuation .)  Interpret  fix,  y)  in  Pfanzagl’s  theory  as  an  element  in  X  that  is 
indifferent  to  a  50-50  gamble  between  x  and  y.  Show  that  (x,  y)  <  ( z ,  w)  -o  u(x)  -f 
u(y)  <  u(z)  +  u(w)  follows  from  (6.8)  and  (6.9)  when  C4  and  C5  are  used  and 
(X,  y )  -<  («,  w)  <T>  y)  <  f  ( 2 ,  w). 

4.  Prove  that  [-<*  is  irreflexive,  (6.2)]  =>  x  —  a:  y  —  y. 

5.  Prove  that  {(6.1),  (6.2))  =>  (x  —  y  z  ~~  w  <$>■  x  —  z  y  —  w  o  w  ~  z 

y  -  x). 

6.  Prove  that  [-<*  is  asymmetric,  (6.2),  x  —  y  2  —  w  o  x  —  z  y  —  w}=> 
(6.1). 

7.  Suppose  that  -<*  on  X  x  A'  is  a  strict  partial  order,  (6.2)  holds,  %  <y,y  <z, 

and  x  -<  2  according  to  (6.3)  and,  with  a  b  ■<=>  ( a  c  <=>  6  c,  for  all 

e  E  X  x  X),  r  —  r  &  *  k  —  s  for  all  r,s  e  X.  Show  that:  (a)  x  —  y  -<*  2  —  x; 

( b)x  —  y  -<*  z  -  y;(c)x  —  2  -<*  y  —  z;  ( d)y  —  *  -<*  2  —  x;  (e)z  —  y  -<*  z  -  x. 

8.  ( Continuation .)  Show  that  [■<*  is  a  strict  partial  order,  (6.1),  (6.2),  x  —  x 
y  —  y  for  all  *,  y  £  X]  =>  <  on  X  as  defined  by  (6.3)  is  a  strict  partial  order. 

9.  Show  that  [^c*  is  a  weak  order,  (6.2),  x  -  y  2  —  w  <$>  x  —  z  y  — 
w]  =>  -<  on  X  as  defined  by  (6.3)  is  a  weak  order. 

10.  Show  that  Bz  of  Theorem  6.1  ( B  with  m  —  2)  implies  (6.2)  and  x  —  y  2  — 
w  o  x  —  2  y  —  w. 

11.  Prove  the  following  theorem.  If  X  is  finite,  if  <*  on  AT  x  X  is  irreflexive, 

and  if  [a1, . . .  ,  x2™,  w1, ...  ,  w2m  is  a  permutation  of  y1, . . . ,  y2m,  z1 . z2m, 

x’  _  y)  z>  _  wi  for  j  —  1 , .  . .  ,  m,  xj  —  y*  -<*  z’  —  wj  for  j  —  m  +  1 ,  .  .  .  , 
2m  —  1]  =>  not  x2m  —  y2m  .<*  z2m  —  w2m,  for  all  positive  integers  m  and  xi ,  y\  z\ 
w’  e  X,  then  there  is  a  real-valued  function  u  on  X  such  that 

x  —  y  -<*  2  —  w  o  u(x)  —  u(y)  +  1  <  u(z)  —  u(w),  for  ail  x,  y,z,w  e  X. 

12.  Interpret  A/1,  M2,  and  M3  (Section  6.3)  in  terms  of  points  on  a  line. 

13.  Show  that  Bl  and  B2  in  Section  6.3  imply  that  if  x  —  y  y  -  y  and  y  — 
2  <  *  w  —  t  then  x  —  2<*w  —  t.  (Use  Exercises  4  and  5.  This  exercise  is  due  to 
Michael  Levine:  see  Suppes  and  Zinnes  (1963,  p.  35).) 

14.  Show  that  [(6.8),  (6.9),  C4,  C5,  x  <  y  for  some  x,  y  e  X]  =>  p  =  q  = 
r  =  0. 

15.  With  -<*  defined  from  -<  on  X  as  in  (6,10),  prove  the  following. 

a.  (Cl,  C2,  (73)  =>  <*  on  X  x  X  is  a  weak  order.  (Due  to  Luce  and  Tukey 
(1964,  p.  14).) 


88 


Comparison  of  Preference  Differences 


b.  (Cl ,  C5)  (6.1)  and  (6.2). 

c.  (Cl ,  C4)  *  —  s  s  —  y  for  some  s  X. 

d.  (C! ,  C2,  C3)  =>  A2  (in  the  Debreu  axioms). 

16.  Let  condition  Bm  of  Theorem  6.1  hold  for  m  <,  6.  Assume  also  tha tf(x,  y)  = 
z  =>  x  ~  z  — *  c  —  y  and  let  (6.10)  apply.  Prove  that  C3,  Pfanzagl’s  bisymmetry 
axiom,  follows. 

17.  Galanter  (1962)  asks  the  following  type  of  question:  What  amount  of  money, 
as  a  gift,  would  make  you  feel  twice  as  happy  as  you’d  feel  if  you  were  to  receive  a 
gift  of  $10?  If  the  response  in  $45  (the  median  for  one  sample),  it  is  sugges:ed  that 
we  set  «($4S)  —  2«($10),  witn  «($0)  =  0.  This  is  the  same  as  taking  u($45)  — 
«($10)  =  «($10)  —  «($0)  so  that  $10  is  midway  in  preference  between  $0  and  $45. 
Do  you  feel  that  this  midpoint  interpretation  is  reasonable  in  view  of  the  question 
that  gave  rise  to  it  and  the  strength -of-preference  interpretation  used  in  the  chapter? 

18.  ( Continuation .)  A  motorist  is  asked  for  his  reaction  to  delays  at  toll  booths 
with  the  question:  What  waiting  time  r  would  make  you  twice  as  mad  as  you  would 
be  if  you  had  to  wait  for  time  tl  Given  the  set  of  (/,  r )  pairs  {(1 , 3),  (3, 8),  (8, 18), 
(18,  30),  (30,  45),  (45,  60),  (60,  75),  (75,  90)}  and  taking  «(r)  -  «(0)  =  2[u(t)  - 
u(0)J  for  each  of  the  eight  (t  r)  pairs,  set  «(0)  =  0  and  «(1)  =  -1  and  sketch  u  on 
[0,  100]. 


Chapter  7 


PREFERENCES  ON  HOMOGENEOUS 
PRODUCT  SETS 


A  homogeneous  product  set  has  the  form  X  —  A  x  A  x  *  ■  • .  If  A  is  re¬ 
peated  n  times,  we  write  X  =  An.  A  common  interpretation  for  An  is  that 
there  are  n  time  periods  and  (xu  . . . ,  xn)  e  An  represents  a  series  of  similar 
events  that  can  be  selected  or  occur  during  the  n  periods :  xi  is  the  event  for 
period  i.  (%, . . .  ,xn)  could  be  a  series  of  annual  incomes  for  the  next  n 
years  or,  in  a  single-period  context,  xi  could  be  the  amount  of  money 
allocated  to  the  ith  of  n  activities. 

With  X  =  An,  this  chapter  examines  concepts  for  the  time  context, 
including  persistence,  impatience,  and  discounting.  Our  usage  of  these  terms 
is  based  on  the  work  of  Koopmans  (1960),  Koopmans,  Diamond  and 
Williamson  (1964),  and  Diamond  (1965)  in  a  denumerable-period  formula¬ 
tion. 

Throughout,  <  on  X  will  be  assumed  to  be  a  weak  order.  Since  the  inde¬ 
pendence  notions  of  Chapters  4  and  5  are  relevant  for  X  =  An,  we  shall 
consider,  in  conjunction  with  the  foregoing  concepts,  special  cases  of 

*  <  y  «,(*<)  <  2  «<(&)>  for  all  x,  y  s  An.  (7.1) 

i=l  i=l 

One  such  case  is  the  no  time  preference  situation  where  p  is  a  real-valued 
function  on  A  and 

x  <  y  ?(*«)  <  2  p(y<)>  for  a11  x’  y  e  A”-  (7-2> 

i'=I  i-  1 

Given  (7. 1),  it  is  easily  shown  that  there  is  a  p  that  satisfies  (7.2)  if  and  only 
if  (*lt . . .  ,xn)  —  (y1; . . .  ,yn)  whenever  xu  ...  ,xn  is  a  permutation  of 
. .  .  ,  yn.  In  the  time  context  this  says  that  times  of  occurrence  of  various 
events  have  no  affect  on  preferences,  which  is  often  false.  Somewhat  more 
realistic  special  cases  of  (7.1)  will  be  considered  later. 


89 


90 


Preferences  on  Homogeneous  Product  Sets 


7.1  PERSISTENCE  AND  IMPATIENCE 

Two  notions  that  postulate  forms  of  regularity  of  preferences  in  the 
homogeneous  time  context  are  persistence  and  impatience.  Persistence  applies 
when  similar  preferences  hold  in  the  various  periods.  Impatience  says  that 
you’d  rather  have  more  preferred  things  happen  sooner  than  later.  In  the 
following  definitions  a  denotes  the  constant  alternative  that  yields  a  e  A  in 
every  time  period :  a  —  (#, . . . ,  a). 

Definition  7.1.  <  on  An  is  persistent  if  and  only  if  (xlt . . . ,  x{_x, 
a,  ~asMt ... ,  xn)  <  (xlt . . .  ,  b,  xiH, ... ,  x„)=>  (yx, . . . ,  y^u  a, 
Vi+u  * . . ,  Vn)  <  (Vn  •  •  •  >  Vt- 1.  Vt+ 1>  •  •  •  .  yn)  whenever  i,  j  e  n) 

and  all  four  n-tuples  are  in  An.  <  on  An  is  impatient  if  and  only  if 
a  <  (*x,  ...  ,  a,  b,  xi+i,  ...,xn)<  (xu  ...,  xt_u  b,  a,  xi+a, . .  . ,  xn) 

and  a~b=>  (xlt ....  xt_x,  a ,  b ,  xi+2, . . .  ,  xn)~  (xu  ... ,  b,  a , 
xi+ it . . ,  ,xn)  whenever  /  e  {1 —  1}  and  the  n-tuples  are  in  An. 

Persistence  seems  reasonable  when  the  ^-tuples  in  X  represent  income 
streams  over  a  period  of  n  years,  Impatience  might  also  hold  in  this  case. 
The  reverse  of  impatience  could  hold  in  some  situations  for  people  who 
prefer  to  postpone  favorable  events,  perhaps  to  increase  their  anticipatory 
pleasure  or  for  a  variety  of  other  reasons.  The  reverse  of  persistence  might 
arise  from  a  desire  for  variety,  as  in  the  chicken-steak  example  preceding 
Section  4.1.  ' 

When  <  is  a  weak  order,  <  is  persistent  implies  that  <f  on  A,  defined  by 
a<tbo  (a?!, . .  .  ,  xt_u  a ,  xi+u  . .  .  ,  xn)  <  (zx, .  . .  ,  *,-i.  b,  xi+l,  ...  ,xn) 
for  some  xu  .  . .  ,  xi+1,  .  .  .  ,  xn  eA,  is  a  weak  order  (which  also 

follows  from  condition  Cs(m  =  2),  Theorem  4.1)  and  that  <ls  are 

identical  (which  does  not  follow  from  C2). 

In  our  definition  of  impatience,  a  and  b  are  in  contiguous  time  periods.  A 
more  general  case  of  impatience  arises  when 

a  <  (^1^  =s*‘  (a^i,  . .  .  ,  a?^x,  .  . .  ,  x^_ x,  b,  .  . .  ,  x^) 

<  •  •  • ,  xi- 1,  b,  *,-+!, . . .  ,  Xj_i,  a,  Xj+l, ...,*„)  (7.3) 

for  any  1  <  i  <j  <n  and  xx , . .  . ,  xn  s  A.  This  does  not  follow  from 
persistence  and  impatience.  The  following  theorem  amplifies  this  statement. 

THEOREM  7.1.  (<  is  a  persistent  and  impatient  weak  order  on  An) 

does  not  imply  (7.3).  (<  is  an  impatient  weak  order  on  An  that  satisfies 
condition  C2  of  Theorem  4.1)  implies  (7.3). 

Proof.  For  the  latter  assertion  it  suffices  to  show  that  the  hypotheses 
imply  that  ( a ,  x2,  . .  . ,  ar„_x,  b)  <  ( — ^)(^»,  x2,  . .  .  ,  ar71..x,  a)  when  a  <  (~)6. 


i 


Persistence  and  Impatience  91 

Given  a  <  )S ,  repeated  applications  of  impatience  give  (a,  b, . . . ,  b)  < 
(' — ’)(^»  atb, ...  ,b)<  ('■"')(£,  b,a,b, , . .  ,b)  <  (<-w)  •  ■  *  <  (- — ’)(*, ...  ,b,a) 
so  that  (a,  b, . . . ,  b)  <  ( ~){b, ...  ,b>  a).  Since  ((a,  bt . . . ,  b),  ( b , 
xn_lt  a))  Ez  ({b, . . . ,  b,  a),  (a,  xs, . . . ,  b )),  {a,  xz, . . .  ,  xn_if  b)  < 

(~)(b,  x2, . . .  ,  xn_u  a)  follows  from  condition  C  with  m  —  2. 

To  verify  the  negative  assertion,  take  A  =  { a ,  b,  c },  n  =  3,  and  let  <  on 
A3  be  defined  by  (7.1)  when  the  ut  on  A  take  the  values  shown  in  the  following 
array.  Clearly,  <  is  persistent  and  as  we  shall  note  in  the  parentheses  it  is 


Ui  1*2 


a 

b 

c 


0  0 

10  9 

20  15 


0 

n 

O 

12 


impatient  (8  <  9  <  10, 12  <  15  <  20, 10  +  15  <  20  +  9, 9  +  12  <  15  ~f-  8) 
and  in  fact  satisfies  (7.3).  In  the  middle  of  <  we  find  *  •  •  <  (b,  a,  c)  < 
(a,  c,  b)  <  ( b ,  c,  a)  <  (b,  b,  b)  ~  (a,  c,  c)  <  •  •  • .  Let  <'  =  <  except  that 
we  replace  (a,  c,  b)  <  (b,  c,  a)  by  ( a ,  c,  b )  (b,  c,  a).  With  this  one  change, 

persistence  and  impatience  hold  also  for  <',  but  (7.3)  fails  since  a  <'  b.  ♦ 

Additive  Utilities 

When  (7. 1)  holds  and  <  is  persistent,  each  ut  function  has  the  same  order 
on  A,  as  illustrated  with  A  =  [0,  1]  on  the  left  of  Figure  7.1.  When  <  is 
impatient  also  we  get  a  picture  like  that  on  the  right  of  the  figure  in  which 


Persistence  and  impatience 


Figure  7.1  Additive  utilities  on  [0,  l)3. 


92 


Preferences  oh  Homogeneous  Product  Sets 


u\(b)  —  Wi(fl)  >  ut{b)  —  Ut(a)  >  uz(b)  —  «,(<?)  whenever  b  >  a  (i.e.,  d  <  5), 
which  says  that  the  vertical  distance  between  ut  and  us,  and  between  ut  and 
aa,  increases  as  b  increases. 

Additive  utilities  can  of  course  hold  when  impatience  holds  and  persistence 
fails.  For,  with  A  —  {a,  b}  and  n  =  2,  it  is  easily  seen  that  (a,  b)  <  (a,  a)  < 
(b>  b)  <  ( b ,  a)  has  a  ux  and  «a  that  satisfy  (7.1).  Since  d  <  h  and  (a,  b)  < 
(b,  a),  <  is  impatient.  However,  <  is  not  persistent  since  (a,  b )  <  (b,  b)  and 
(b,  b)  <  (b,  a). 

7.2  PERSISTENT  PREFERENCE  DIFFERENCES 

We  shall  now  look  at  a  higher-order  persistence  notion  based  on  the  degree 
of  preference  relation  <*  on  X  x  X  used  in  Chapter  6,  aiong  with  the 
weak-order  difference  representation 

x  —  y  <.*  z  —  w  -e=>  «(*)  ~  u(y)  <  u(z )  —  u(w),  for  all  x ,  y,  z,  w  e  A". 

(7-4) 

As  in  Definition  7.1,  3  =  {a, . . . ,  a)  in  the  following. 


Definition  7.2.  <*on^n  x  is  persistent  if  and  only  if 

x  -  y  <*  z  -  w  O  x,  -  y}  <*  _  w,  (7.5) 

whenever/  e  {1 xt  **  yt  and  zf  =  w{  for  all  i  ^  j,  and  x,  y,z,we  An. 

This  says  that  the  order  of  preference  differences  with  constant  alternatives 
dictates  the  order  of  differences  for  each  /,  other  things  being  equal.  With 
n  =  2,  <*  is  persistent  implies  that  if  (a,  x2)  —  (b,  x2)  <*  (c,  yt)  —  (d,  yj 
for  some  x2,  y2e  A  then  this  holds  for  every  x2t  y2eA  and,  in  addition, 
(*i> a)  —  (xu  b)  <*  (yly  c)  —  (ylt  d)  holds  for  every  xlt  yx  e  A. 

Part  of  the  power  of  persistent  <  *  is  shown  by  the  next  theorem. 

THEOREM  7.2.  Ifu  on  An  satisfies  (7,4)  and  if  <  *  is  persistent  then  there 
are  real-valued  functions  ut, .  .  .  ,  un  cn  A  for  which 

n 

u(* u  2  «<(»<),  for  all  x  6  An,  (7.6) 

<=i 

and,  for  every  a,  b,  c,  d e  A  and  i,je{  1, . . .  ,  n). 

Ufa)  -  ufb)  <  ufc)  -  ufd)  o  ufia)  -  ufb)  <  u,(c)  -  ufid).  (7.7) 

When  <  is  defined  as  in  (6.3),  (7.1)  follows  immediately  from  (7.6)  and 
(7.4).  Hence,  additive  utilities  exist  for  An  when  (7.4)  holds  and  <*  is 
persistent. 


Persistent  Preference  Differences 


n 


Proof  ’.  Let  u  satisfy  (7.4)  and  assume  that  <*  is  persistent.  Fix  ee  A, 
assign  ,  un(e)  so  that  u(e)  =  2  ut(e)>  an<l  define  ut  on  A,  for 

/  m  1 , . . .  ,  n,  by 

ut(a)  a*  u(e, . .  . ,  e,  a,  e, , . . ,  e)  -~Xui(e)>  for  all  a  6/4.  (7.8) 

To  verify  (7.6),  let  a<  =  . . .  ,  e, . . . ,  e ),  -  (xit . . .  ,  *,_t,  <?, 

and  y*  =  (<?, ....  <?,  <?, ... ,  e ),  for  2  <,  i  <,  n.  If  a*  —  ft  <  * 

yi  —  e  then  x(  —  e  <*  £t  —  e  by  (7.5),  and  similarly  if  y*  —  #  <*  a*  —  fi*. 
Hence  ot*  —  /?*  y‘  —  e  so  that,  by  (7.4), 

u(xu  e, ...» e)  -  u(«i, . . .  ,  <?,...,*) 

“«(«»>..,  as*,*,...,*)  —  «{e). 
Summing  from  /  =  2  to  i  =  «  and  using  u(e)  —  2  «i(e)  and  (7.8)  we  get 

«  « 

u(«i . *„)  -  e, . . . ,  e)  =  2  «<(*<)  -  2  “<(*)» 

i**2  i»2 

which  yields  (7.6)  after  u(xu  e, . , . ,  e)  is  transposed  and  (7,8)  is  used  again. 
(7.7)  follows  easily  from  (7.4),  (7.5)  and  (7.6).  ♦ 

Weighted  Additivity 

In  the  rest  of  this  section  we  shall  consider  a  form  of  weighted,  additive 
utilities  that  is  less  general  than  (7.1)  and  more  general  than  (7.2).  This  is 
the  form 


*<y  of  *ip(xi)  <  2  ZiPiVi)’  f°r  all  x,ye  An,  (7.9) 

t-i  t-i 

where  A*  >  0  for  each  i  and  p  is  a  real-valued  function  on  A.  It  is  easily  seen 
that,  when  (7.1)  holds,  (7.9)  can  hold  if  and  only  if  there  are  ut  satisfying 
(7.1)  that  are  pairwise  related  by  positive  linear  transformations,  say  with 
Uj  =  asut  +  bi  and  at  >  0  fory  =  2, . . .  ,  n. 

In  the  time  context  the  A<  are  weights  for  the  different  periods.  If  > 
Ag  >  ■  •  •  >  A„,  we  could  call  them  discount  factors:  Ax  >•••  >  An  follows 
from  (7.9)  when  <  is  impatient  and  *  <  y  for  some  x,  y  e  X.  If  Ax  <  •  •  •  <  An, 
the  Aj  might  be  referred  to  as  markup  factors. 

In  general,  (7.1)  along  with  <  persistent  is  insufficient  for  (7.9).  As  this 
is  written,  I  do  not  know  of  any  set  of  axioms  for  <  on  An  that,  even  when  A 
is  finite,  is  necessary  and  sufficient  for  (7.9).  For  this  reason,  and  because 
(7.9)  implies  (7.7)  when  u{  =  A,/>,  we  shall  consider  a  pathway  to  (7.9)  that 
leads  through  (7.4)  and  makes  the  assumption  that  <*  is  persistent.  Even 
here  we  shall  note  a  negative  conclusion  before  giving  sufficient  conditions 
for  (7.9). 


94 


Preferences  on  Homogeneous  Product  Sets 


THEOREM  7,3.  Suppose  (7.4)  holds  and  <*  is  persistent .  Then  with  < 
defined  by  x  <y  ox  —  z  <*  y  —  x,  there  may  not  exist  2<  >  0  and  a 
real-valued  function  p  on  A  that  satisfy  (7.9).  This  conclusion  holds  even  when 
u  in  (7.4)  is  unique  up  to  a  positive  linear  transformation. 

Proof.  Let  A  *  {a,  b ,  c }  and  n  ~  3,  and  let  (7.1)  hold  with  the  ut  defined 
as  follows: 

a  b  c 


ut 

u* 


0  1  3 
0  2  5 
0  3  9 

Define  u  by  (7.6)  and  take  x  —  y  <  *  z  —  wou(x)  —  u(y)  <  u(z)  —  u(w). 
Because  u(b)  —  u(d)  <  u(c)  —  u(b)  and  ut(b)  —  ufa)  <  ufc)  —  «*(/>)  for 
i  a*  1,  2,  3,  <*  is  persistent.  In  defining  Xlt  2a,  23  and  p  for  (7.9)  we  can, 
with  no  loss  in  generality,  set  2a  =  1,  p{a)  =  0  and  p(6)  =  1.  Then,  since 
(c,  a,  a)  ~  (a,  a,  b)  ~  (6,  b,  a),  p(c)  =  23  =  22+  1.  This  along  with 
(a,  a,  c)  ~  ( b ,  c,  b )  gives  (2a  +  1)*  »  1  +  2j(22  +  1)  +  (2a  +  1)  according 
to  (7.9),  and  this  reduces  to  1  =  2,  which  is  false.  Hence  (7.9)  cannot  hold. 
Moreover,  u  is  unique  up  to  a  positive  linear  transformation  when  it  satisfies 
(7.4).  This  follows  from  the  fact  that  each  of  the  25  other  u(xlt  x3,  *,)  can  be 
written  solely  in  terms  of  u{a,  a ,  a)  and  u(b,  a,  a)  when  (7.4)  holds.  + 

Sufficient  Conditions  for  Weighted  Additivity 

Despite  Theorem  7.3  there  are  axioms  implying  (7.4)  which  imply  (7.9) 
also  when  <*  is  persistent.  We  consider  one  such  case,  based  on  Debreu’s 
theory.  The  following  correspond  to  Al-Ad  in  Section  6.3.  X  —  An. 

AY.  x  —  y  <*  z—  w=>w  —  z<*  y  —  x, 

AT.  Ifx\  x\  x*  is  a  permutation  of  z1,  z2,  z3,  and  y1,  y2,  y3  is  a  permutation 
of  wl,  wa,  w3,  and  if  x’  —  yi  <  *  z’  —  w}  or  x}  —  yi  z *  —  w’  for  j  =  1,2, 
then  not  x3  —  y3  <  *  z3  —  w3, 

AT.  (A,  G)  is  a  connected  and  separable  topological  space , 

Ad',  {x  —  y  :x  —  y  e  X  X  X,z  —  y  <*  z  —  w}e  G2n  and  {x  —  y.x  —  y  e 
X  x  X,  z  —  w  <*  x  —  y}e  G2"  for  every  z  —  w  e  X  X  X. 

G*n  is  the  product  topology  for  X  x  X  =  An  x  An.  By  Lemma  5.3,  +3' 
says  that  (An,  15")  and  (X  x  X,  IS2")  are  connected  and  separable  topologi¬ 
cal  spaces.  It  then  follows  from  Theorem  5.4  and  A 1 '  that  there  is  a  continuous 
(in  G")  real-valued  function  u  on  X  that  satisfies  (7.4)  and  is  unique  up  to  a 
positive  linear  transformation. 

Let  on  A  be  defined  by  (7.8)  in  the  proof  of  Theorem  7.2.  Since  u  is 
continuous  in  G",  is  continuous  in  G  for  each  /.  Let  <*  on  X  x  X  be 


Constant  Discount  Kates  95 

persistent  and  define  <*  on  A  x  A  by 

a  —  b  <  *  c  —  d  o  d  —  b  <  *  d  —  d,  for  all  a ,  b,  c,  d  e  A. 

It  then  follows  from  Theorem  7.2  and  persistence  that 

a  —  b  <*  c  —  do  u{(a)  —  u{(b)  <  ufc)  —  ut(d),  for  all  i.  (7.10) 

This  is  (7.4)  in  miniature,  for  A  instead  of  X.  Since  ut  is  continuous  and  (7. 10) 
holds  for  each  /,  the  correspondents  of  AV-A4'  hold  for  <  *  on  A1  for  each  i. 
It  follows  that  w,  and  Uj  are  related  by  a  positive  linear  transformation.  In 

particular  there  are  positive  * . ,  oc„,  and  fit*  •  •  •  i  fin*  such  that  Uj{a)  «= 

+  fit  ^or  all  a  6  A,j  =s  2, . .  * ,  n.  Letting  j)  s  alt  ^  a  I,  A^  =  <x t  for 
j  >  1,  (7.6)  gives  u(x)  =  Kp(xd  *1"  constant,  which,  on  using  (6.3)  and 
(7.4)  gives  x  <  y  o  £  2,/jfo)  <  2  all  Af  >  0.  This  proves  the  first 

part  of  the  following  theorem. 

THEOREM  7.4.  Suppose  X  —  An,  <*  on  X  x  X  is  persistent ,  and  ,41', 
A2',  ,43',  and  A4'  hold.  Then  there  are  A,  >  0  and  a  continuous  (in  t>)  real¬ 
valued  function  p  on  A  that  satisfy  (7.9)  when  <  on  A”  is  defined  from  -<  *  on 
An  x  An  by  x  <  y  o  x  —  x  •<*  y  ~  x.  If  in  addition  n  >  1  and  x  <  y  for 

some  x,y  eX  and  if  X’  >  0  and  p'  on  A  satisfy  (7.9)  a/an^  with  As  >  0  and  p 

on  A,  then  there  are  numbers  a  >  0,  fi  >  0  and  y  such  that 

A/  =  «A,  i  =  1, ....  n  (7.11) 

p'(a)  —  fip(a)  -f  y  for  all  a  e  A.  (7.12) 

Proof.  For  the  uniqueness  assertions  take  p  and  the  Af  as  defined  for  the 
first  part.  Let  wt(a)  =  \p(a)  and  u(x)  —  £  «<(*,-).  Then,  as  in  the  first  part 
of  the  proof,  u  is  continuous  and  hence,  by  Theorem  3.5,  {x:x  <  y}  e  7Sn 
and  {x;y  <  x}  e  '6n,  which  establish  condition  Q3  of  Theorem  5.4  (n  =  2) 
or  condition  Q3*  of  Theorem  5.5  (n  >  2).  With  <*  persistent  and  x  <  y 
for  some  x,y  eX,  each  of  the  n  factors  has  an  active  influence  on  <.  Since 
the  other  conditions  of  Theorem  5.4  or  Theorem  5.5  are  easily  seen  to  hold 
for  <  on  A”,  by  (7.9),  it  follows  that  the  Xtp  in  (7.9)  are  unique  up  to  similar 
positive  linear  transformations.  Hence  A/  >  0  and  p  satisfy  (7.9)  if  and  only 
if  there  is  a  k  >  0  and  fi{  such  that  X'tp'(a)  =  kX^ia)  4-  fif  for  i 1 , . . , ,  «. 
Since  this  gives  p'(a)  =  (kXJX'i)p(a)  +  $/Af,  p  is  a  positive  linear  trans¬ 
formation  of  p  as  in  (7.12).  Also,  since  p  is  not  constant  on  A  (x  <  y  for 
some  x,  y),  kXJX.  —  kXJX'}  for  all  /,  j,  or  A'  =  (X'JX^X.  for  y  —  2, ... ,  n. 
Set  a  =  X[/Xv  (7.11)  then  follows.  ♦ 

7.3  CONSTANT  DISCOUNT  RATES 

Although  persistent  preference  differences  were  used  to  obtain  (7.9)  for 
arbitrary  positive  A,,  special  cases  of  (7.9)  can  be  derived  using  only  the 


96 


Prtftrtr*ts  on  Homogtmoomt  Product  Seta 

stmple  preference  relation  <.  One  of  these  is  (7.2),  Another  occurs  when 
W*,  *r  for  i  *=  1 ,  2, . . . ,  i*  —  1  with  it  >  0,  in  which  case  (7.9)  reduces 

ft  n 

x<y  <=►  2  sr<_1p{«<)  <  ]£  •«'<"1p(yi),  for  al  I  s,  y  e  A".  (7,13) 

If  7r  »  1  we  have  (7.2),  the  case  of  no  time  preference.  If  ir  <  I,  (7  13) 
represents  the  case  where  utilities  are  discounted  at  a  constant  rate  which 

< is  ^ — -  >  >■  «—  «  £3 

T??“  ?  °btai?  1J,13)  is  t0  be«in  with  Debreu’s  additivity  theory. 

y  3,*S  Sha  i  USe  the  h>Pothcses  of  Theorem  5.5  applied  to 

~  A.  ^  for  0  along  with  one  more  condition.  The  new  con- 
ition  is  referred  to  as  temporal  consistency  by  Williams  and  Nassar  (1966) 
and  as  stationarity  by  Koopmans  (I960).  v  J 

Definition  7.3.  <  on  An  is  stationary  if  and  only  if  there  is  an  e  e  A  such 
that,  for  all  *lf . . . ,  ^  yn^  <=A, 

(Xu  *n“1’ e)  <  iVl'  *  ‘  ^  <*>  *!>•••,  *„-i)  <  (e,  yx . yn_x). 

t  ■  ,  ,  C™) 

n  going  rom  (*i, . .  • ,  xn_t,  <?)  to  (e,  xx,,.,t  arB_1)  each  ar,  is  updated  by 

one  period  and  e  is  shifted  from  the  last  period  to  the  first.  Stationarity  says 
that  preferences  do  not  change  under  such  shifts. 

™EM  7’5>  lfthe  hypotheses  of  Theorem  5.5  hold  for  X  *  A*  and  if 
<onA  is  stationary  then  there  is  a  positive  number  n  and  a  continuous  real¬ 
valued  function  pan  A  that  satisfy  (7.13).  Moreover,  ir  is  unique  and  p  is 
unique  up  to  a  positive  linear  transformation. 

Proof  Let  the  hypotheses  hold,  with  continuous  m  for  (7.2)  unique  up  to 
similar  positive  linear  transformations.  Define  <  on  4n-i  by  V 

•  ■  •  * c-l)  <iy' . »-■>  -=*<*■ . «)<<*.....  v,-„  e). 

Itfonows  from  (7.2)  and  (7.14)  that,  for  all  <*, . (* . 6 

(x‘ . *— *>  <  •  s',-.)  o'i  ».(*,)  <2..,(,,) 

i=i  »=i 

(**’ '  •  •  ’  Xn~l)  <  to*  •  •  •  ’  y«-i)<^2w<+i(*,)  <  2«<+i(y,). 

It  fofiows  from  these  two  expressions  and  Theorems  5.4  and  5.5  that  there  is 
a  tt  >  0  and  numbers  ft, ,  /9n_x  such  that 


ui+i(<t)  —  t 7«,(a)  +  for  all  a  e  ; 


i  =  1 , . . ,  ,  n  —  1 . 


97 


Exerciitt 

Using  this  recursively  to  express  each  ut  in  terms  of  ut  and  letting  pan, 
substitution  into  (7.2)  yields  (7. 1 3).  B  P  u 

Suppose  (7. 13)  holds  then  along  with  <  Yt  A'-W) 

From  Debreu  s  uniqueness  up  to  similar  positive  linear  transformations  ii 
follows  that  there  are  numbers  «>  0  and  ft, ...  ,  /9„_1  such  that 

A^afa)  “  « w*_1p(fl)  +  pt  for  all  ae  A;  /  *  1, —  l. 

rilV  =  !it1his/g\ves  "  «/*0  +  A-  Substituting  for  o  with  />  1  w 
then  have  2  «p(«)  +  A  %  «  n'  ^p(a)  +  &  which,  since  p  is  not  constant 
on  ,4,  requires  A  as  7r,  +> 

7.4  SUMMARY 

When  A'  =  A*  and  /  indexes  time,  new  concepts  come  into  play,  including 
no  time  preference,  impatience,  persistent  preferences,  persistent  preference 
differences,  and  stationanty.  These  concepts  can  apply  whether  or  not 
utilities  are  additive  over  the  n  periods. 

■The  most  general  special  case  of  additivity  considered  in  this  chapter  is  the 
weighted  form  with  Af  >  0  for  each  /. 

Debreu  s  topological  theory  for  weak  ordered  preference  differences  along 
with  persistent  preference  differences  implies  this  form.  Additive  utilities6 
but  not  necessarily  the  weighted  form  given  here,  arise  from  the  representation 

ence  differences^  ^  <  “  “(M°  al°"S  with  Persistent  Pl¬ 

under  appropriately  strong  axioms  for  additive*  utilities  based  on  simple 
preference  comparisons,  the  form  ar  <  <  Y  \  can 

result  when  <  is  assumed  to  be  stationary.  If  <  is  impatient  also  then 

U  <,  TT  <  1 . 


INDEX  TO  EXERCISES 

1-3.  No  time  preference.  4-5.  Persistent  preferences.  6.  Impatience.  7.  Persistent 
differences.  8  Nonhomogeneous  preference  difference  additivity.  9-ta  Weighted  additivity. 
1  12.  Constant  discount  rate.  13-14.  Present  monetary  value. 


Exercises 

1.  Given  (7.1)  prove  that  (7.2)  follows  when  (*„ _ ~  (Vt 

whenever  xu  . . .  ,  x„  is  a  permutation  of  ylt . . .  ,  yn.  Define  P  by  P(a)  =  yV  '  u  (a) 

2.  With  *  £  A",  let  (x\  .  . .  ,  E*  ^ . .  .  ,  ym)  _  [m  >  lf  afi>  ’  / 

y  ,  ■  •  • ,  ym  6  X\  the  number  of  times  a  e  A  appears  as  a  component  in  (x1, ...  t  xm ) 


98 


Preferences  on  Homogeneous  Product  Sets 


equals  the  number  of  times  it  appears  as  a  component  in  (y1, . . .  ,  ym),  for  each 
a  £  A].  Let  condition  C'  be:  [(as1, . . . ,  xm)  E*  (y\  ....  ym),  x>  -<  yi  or  xi  —  yi 
for  y  **  1, . . . ,  m  —  1]  =>  not  xm  ■<  ym.  Show  that  C'  C  of  Theorem  4.1  and 
that  C'=>if  x,yeX  and  xlt  ...,xn  is  a  permutation  of  yu  .  then 

( xl *  •  ■  •  >  xn)  '*w  (Vli  •••  i  l/n)- 

3.  With  X  ~  A  x  A  suppose  u{a,  b)  =  u{b,  a)  for  ail  a,  be  A  and  that  x  -<  y  o 
#(x)  <  «(y),  With  /l  -  Re,  specify  a  u  that  satisfies  these  conditions  (define  •< 
from  <)  and  for  which  there  is  no  corresponding  additive  representation  as  in  (7.1). 

4.  With  X  =  An  suppose  -<  on  An  is  a  persistent  weak  order.  Define  <°  on  A 

by  a  <°  6  <-->  (xu  . . . ,  xlW,  a,  xi+1 . xn)  -<  (xlf . . . ,  x^,  b,  xH1, ,  xn)  for 

some  e  A.  Prove 

a.  <°  on  A  is  a  weak  order, 

b.  (%i  <°Vi  or  x{  ~°  Vi  for  i  =  1, . . .  ,  n)  =>  x  <  y, 

c.  (xi  yi  for  all  /  and  xt  -<°  y{  for  some  /)  =>  x  <.  y. 

5.  Suppose  -<  on  An  is  a  strict  partial  order,  -<  is  persistent,  and  on  A  is 
defined  as  in  the  paragraph  preceding  (7.3).  Prove  that  each  is  a  strict  partial 
order  and  all  -<,  are  identical.  Show  also  that  when  -<,  is  defined  in  this  way  and 
<  is  persistent  then  it  is  possible  to  have  all  identical  weak  orders  on  A  when 
-<  on  An  is  not  even  a  strict  partial  order. 

6.  Show  that  ut(b)  —  «x(a)  >  u2(b)  -  u2(a)  >  •  •  •  >  un{b)  —  un(a)  when  a  <b, 
(7.1)  holds,  and  -<  is  impatient. 

7.  Show  that  if  X  —  An,  -<*  on  X  x  X  is  persistent,  and  x  -<  y  -o  a;  -  x  <* 
y  —x,  then  -<  on  X  is  persistent. 

8.  Show  that  if  X  =  Xu  if  (7.4)  holds  for  all  x,y,z,weX  and  if  a;  —  x  -<* 
y  -  y  =>  z  -  s'  -<*  tv  -  w'  whenever  /  e  {1, . . .  , «},  (xy  =  x'},  yt  =  y',  zj  =  z', 
wi  —  w’f)  for  all  j  jt  i  and  (xf,  x'.,  y{,  y')  =  (zf,  z'.,  h^.,  w'.)  then  there  are  real-valued 
ut  on  Xi  that  satisfy  «(x)  =>  2  «.(*,)  for  all  x  e  A*. 

9.  Show  that  (7.9)  holds  with  the  2,  >  0  if  and  only  if  there  are  w,  satisfying 
(7.1)  that  are  positive  linear  transformations  of  each  other. 

10.  Verify  the  linear  transformation  assertion  in  the  proof  of  Theorem  7.3. 

11.  Show  that  if  (7.13)  holds  with  n  >  0  and  if  -<  is  impatient  and  x  -<y  for 
some  x,  y  6  An,  then  tt  <  1. 

12.  Under  the  hypotheses  of  Theorem  7.5  does  (7.14)  hold  for  every  e  e  A? 

13.  Williams  and  Nassar  (1966).  Let  H  be  the  following  set  of  hypotheses: 
X  =  Re",  conditions  1, 2,  and  3  of  Theorem  3.3,  and  x  -<  y  0  -<  y  —  x,  for 
all  x,yeX.  The  final  assumption  is  referred  to  as  “marginal  consistency.”  Show  that 
the  following  hold,  given  H. 

q,  x  ~  y  o  x  —  y  ~  0. 

b.  x  o  -x  ~  —  y. 

c.  x  ~  y  x  -f  z  —  y  +  z  for  every  z  e  Ren. 

d.  (x  ~  y,  z  ~  w)  x  +  z  ~  y  -f  >v. 

e.  x  ~  y  Afx  ~  Afy  for  every  integer  Af. 


Exercises 


99 


/.  x  -<  y  -y  ~<  -x. 

g.  x  <y  o  x  -rz  <y  +  z  for  every  .z  e  Ren. 

h.  (a;  -<  y,  z  <  w)=>  x  +  z  <y  +  w. 

i.  x  -<  y  =>  Mr  <  My  for  every  positive  integer  M,  and  x  ■<  yx>  My  -<  Mx 
for  every  negative  integer  M. 

j.  If  M  is  a  nonzero  integer  then  Mx  ~  My  =>x  ~y. 

k.  x  ~  y  =>  ax  ~  ay  for  every  rational  number  a. 
m.  x  ~  y  =>  oca:  ~  ay  for  every  *  e  Re. 

14.  ( Continuation .)  Show  that  //  implies  that  there  are  positive  numbers  Xlt  . . , 
such  that 

X  <y<=>][A,x,  <y  /,yt-,  for  all  x,yeX.  (7.15) 

<=i  f=i 

To  do  this  show  firot  that,  for  each  x  e  Ren,  there  is  one  and  only  one  a  e  Re  1 
which  x  —  a.  Then  take  u(x)  =  a  when  x  ~  a,  so  that  u  satisfies  a ;  y  o  «(*) 
«(y).  Finally,  use  results  d  and  m  of  the  preceding  exercise  to  show  that  u  can 
written  as  «(x)  =  J  ?.{xf  where  ~  (0 . 0, 1, 0, . . . ,  0). 


X  A  $ 


PART 


II 


EXPECTED-UTILITY 

THEORY 


Until  the  ruid  twentieth  century,  utility  theory  focused  on  preference  struc¬ 
tures  that  do  not  explicitly  incorporate  uncertainty  or  probability,  the 
yardstick  for  uncertainty.  The  expected-utility  theory  of  John  von  Neumann 
and  Oskar  Morgen  stern,  and  an  earlier  theory  by  Frank  P.  Ramsey,  stimu¬ 
lated  nev  :nterest  in  the  role  of  uncertainty  in  preference  structures. 

An  expected-utility  theory  may  incorporate  probabilities  in  the  alternatives 
of  the  preference  structure  or  it  may  formulate  uncertainty  in  the  alternatives 
without  iw  nrior  encoding  in  terms  of  probability.  In  the  latter  case,  proba- 
bilit;  es  as  well  as  utilities  arc  derived  from  the  axioms.  In  the  former  case  only 
utilities  are  derived  from  the  axioms  since  the  probabilities  are  already  part 
of  the  axiomatic  structure.  The  former  approach  is  used  in  this  part  of  the 
book  -  the  alternatives  are  probability  measures  defined  on  a  set  of  conse¬ 
quences.  Basic  theory  is  in  Chapters  8,  9,  and  10:  additive,  expected-utility 
theory  for  multiple-factor  situations  is  in  Chapter  11. 


Chapter  8 


EXPECTED  UTILITY  WITH  SIMPLE 
PROBABILITY  MEASURES 


When  each  strategy  or  decision  alternative  corresponds  to  a  simple  proba¬ 
bility  measure  on  the  consequences  in  a  set  X,  we  consider  the  expected- 
utility  model  for  computing  utilities  of  the  strategies,  or  their  associated 
measures.  The  idea  for  this  model  dates  at  least  from  Bernoulli  (1738)  but  it 
was  not  until  the  present  century  that  apparently  reasonable  preference 
axioms  were  given  as  a  basis  for  the  model.  The  axioms  of  this  chapter  are 
similar  to  those  initialed  by  von  Neumann  and  Morgenstern  (1947)  and  to 
later  modifications  by  Friedman  and  Savage  (1948,  1952),  Marschak  (1950), 
Herstein  and  Milnor  (19r3),  Cramer  (1956),  Luce  and  Raiffa  (1957),  and 
Blackwell  and  Girshick  (1954).  The  last  of  these  applies  to  probability 
measures  that  are  more  general  than  those  considered  in  this  chapter.  They 
will  be  cammed  in  Chapter  10. 

After  an  introductory  example  and  a  brief  discussion  of  simple  probability 
measures  we  shall  consider  the  basic  theorem  and  then  offer  some  criticisms 
of  its  preference  conditions.  A  complete  proof  of  the  basic  weak-order 
theorem  is  given  in  Section  8.4.  The  case  of  intransitive  indifference  is 
investigated  in  the  next  chapter. 

8.1  EXAMPLE 

Suppose  that  the  owner  of  a  small  construction  firm  plans  to  submit  a 
sealed  bid  for  a  job  that  he  estimates  will  cost  his  company  $200000  to 
complete.  If  he  bids  $a;  and  gets  the  job,  he  will  be  paid  Sx:  his  profit  is 
$*  -  $200000. 

Since  the  construction  industry  is  in  a  slump,  he  believes  that  there  will  be 
many  bids.  From  his  prior  experience  and  knowledge  of  the  current  situation 
he  estimates  the  probability  p(x)  of  getting  the  job  if  he  bids  $*,  [Winkler 


103 


104 


Expected  Utility  with  Simple  Measures 


P(x) 


Figure  8.1  Probability  of  getting  job  for  a  bid  of  Sar. 

(1967c,  19676)  discusses  some  ways  of  doing  this.]  p(x )  for  190000 
x  <£  300000  is  shown  in  Figure  8.1. 

Because  of  the  scarcity  of  work  the  owner  would  be  willing  to  take  the  job 
at  a  loss  of  not  more  than  $10000.  In  other  words,  (get  job  and  make 
—  $10000) (don’t  get  job).  Using  an  appropriate  method  of  scaling  utilities 
for  the  expected-utility  model  [see,  for  example,  Pratt,  Raiffa,  and  Schlaifer 
(1964),  Swalm  (1966),  or  Fishburn  (1967)],  the  owner  estimates  his  utility 
function  for  net  profit  (assuming  he  gets  the  job)  as  shown  in  Figure  8.2.  The 
figure  indicates  that  he  is  indifferent  between  making  $10000  with  certainty 
and  a^SQ^O  gamble  giving  either  —  $10000  or  $100000.  He  is  indifferent  also 
between  making  $50000  with  certainty  and  an  80-20  gamble  giving  $100000 
(with  probability  .8)  or  —$10000  (with  probability  .2).  According  to  the 


Simple  Probability  Measures 


105 


expected-utility  model,  the  latter  indifference  comparison  transforms  into 
u($50000)  *  .8«($ 100000)  -4-  .2w(— $10000).  Equations  such  as  this  can  be 
used  as  a  guide  in  constructing  and  checking  u. 

If  he  bids  $x  his  expected  utility  will  be  p(x)  u( get  the  job  and  make 
$a;  —  $200000  net  profit)  +  [1  — Jp(x)]w(don,t  get  job).  By  Figure  8.2  and 
(get  job  and  make  —  $10000)  ~  (don’t  get  job),  «(dr»nvt  get  job)  —  0  so 
that 

“(bid  $x)  =  /)(x)w(get  job  and  make  $x  —  $200000  net  profit). 

Reading  off  approximate  values  for  p(x)  and  u(  $x  -  $200000)  from  Figures 

8.1  and  8.2  we  obtain  the  expected-utility  curve  in  Figure  8.3,  which  shows 
that  expected  utility  is  maximized  at  about  x  =  206000.  A  bid  of  about 
$206000  is  therefore  recommended. 

8.2  SIMPLE  PROBABILITY  MEASURES 

Definition  8.1.  A  simple  probability  measure  on  If  is  a  real-valued 
function  P  defined  on  the  set  of  all  subsets  of  X  such  that 

1 .  P{A)  ^  0  for  every  A  S  X, 

2.  P(X)=  1, 

3.  P(A  u  B)  =  P(A)  +  P(B)  when  A,  B  £  X and  A  n  B  =  0 , 

4.  P(A)  —  1  for  some  finite  A  c  X. 

Property  (4)  distinguishes  P  as  a  simple  probability  measure.  Chapter  10 
removes  this  restriction  and  considers  expected  utility  for  more  general 
measures. 

Property  (3)  is  the  finite  additivity  property:  the  probability  of  the  union 
of  two  disjoint  subsets  of  X  equals  the  sum  of  the  two  separate  probabilities. 


106  Expected  Utility  with  Simple  Measures 

P{{x}),  which  we  shall  write  as  P(x),  is  the  probability  assigned  by  P  to  the 
unit  subset  {x}  of  X. 

THEOREM  8.1.  Suppose  P  is  a  simple  probability  measure  on  X.  Then 
P{x)  =  0  for  all  but  a  finite  number  of  x  eX  and,  for  all  A  £  X, 

P(A)  -  2  P(x).  (8.1) 

xGA 

Proof.  Suppose  P  is  simple  and  /lisa  finite  subset  of  X  for  which  P(A')  = 
1.  Then  P(x)  —  0  for  all  x  $A,  for  otherwise,  if  P(x)  >  0,  P(A  u  {*})  >  1 
by  (3)  of  Definition  8.1,  which  by  (1)  and  (3)  then  leads  to  P(X)  >  !, 
contradicting  (2).  By  successive  uses  of  (3),  (8.1)  holds  when  A  is  finite.  For 
arbitrary  A  £  X  let  B  ~  {x:x  e  A,  P(x)  >  0}  and  C  =  { x:x  e  A,  P(x)  =  0}. 
By  (3),  P(A)  —  P(B)  4-  P(C).  Moreover,  B  is  finite  so  that  (8.1)  holds  if 
P(C)  =  0.  If  P(C)  >  0  then,  by  (3),  P(C  u  {x:xeX,  P(x)  >  0})  >  1  since 
if  P{x:x  e  X,  P(x)  >  0}  <  I  then,  by  (8.1)  for  finite  sets,  P(D)  <  1  for  every 
finite  D  £  X.  Hence,  if  P(C)  >  0  we  find  again  that  P(X)  >1.  4 

Convex  Combinations  of  Measures 

In  expec tod-utility  theory  we  use  a  rule  for  combining  two  probability 
measures  t  j  form  a  third  measure.  This  rule  can  of  course  be  extended  to 
the  combination  of  any  finite  number  of  measures. 

Definition  8.2.  If  P  and  Q  are  simple  probability  measures  on  X  and 
ae  [0,  1]  then  aP  4-  (1  —  a)Q  is  the  function  that  assigns  the  number 
a P(A)  +  (l  -  a )Q(A)  to  each  A  s=  X. 

Under  the  definition’s  hypotheses  it  is  readily  seen  that  aP  4-  (1  —  x)Q 
is  a  simple  probability  measure  on  X. 

If  P($I00)  =  .3,  P($200)  =  .7,  £>($100)  =  .5,  and  £($300)  =  .5  then, 
with  R  =  .IP  -f  .90,  P($100)  »  .48,  P(S200)  =  .07,  and  P($300)  =  .45. 

Expected  Value 

If  P  is  a  simple  probability  measure  on  X  and / is  a  real- valued  function  on 
X  then  the  so-called  expected  value  of  /  with  respect  to  P,  written  here  as 
£(/,  P),  is  defined  by 

E(f,P)  =  2  f(x)P(x).  (8.2) 

xeX 

With  P,  O,  and  R  as  in  the  preceding  paragraph  and  with/(:r)  =  x,  E{f,  P)  = 
$170,  £(/,  Q )  =  $200,  and  E(f,  R)  =  $197  =  .IE(f,  P)  +  .9 £(/,  Q ).  In 
general,  E(f,  aP+  (1  -  a)Q)  =  a £(/,  P)  4-  (1  —  «)£(/,  £?)• 


Expected  Utility  for  Simple  Measures  107  J 

8.3  EXPECTED  UTILITY  FOR  SIMPLE  MEASURES  I 

| 

If  ${  is  the  set  of  all  simple  probability  measures  on  X  then  the  measures  j 

that  correspond  to  the  strategies  in  the  type  of  situation  considered  in  this  f 

chapter  comprise  a  subset  of  In  our  preference  conditions  for  expected  4 

utility  we  shall  use  all  distributions  in  3‘a  for  tv/o  related  reasons.  The  first  is  | 

for  mathematical  expediency,  for  when  (T,  is  used  it  is  closed  under  convex  i 

combinations  as  defined  by  Definition  8.3:  iff,  Qe  If,  and  a  £  [0,  1]  then  1 

ctP  +  (i  —  x)Q  e  ‘J‘a.  The  second  reason  concerns  the  estimation  of  utilities, 
for  when  the  theory  is  used  as  a  basis  for  estimating  u  on  X  it  is  often  con-  f 

venient  to  use  measures  in  3“,  that  have  P(x)  >  0  for  only  one  to  two  x  e  X, 
and  such  measures  may  correspond  to  no  actual  strategies.  j 

The  following  theorem  will  be  seen  to  be  a  corollary  of  a  more  general 
theorem  that  is  presented  and  proved  in  the  next  section. 

THEOREM  8.2.  Suppose  that  is  the  set  of  all  simple  probability  measures 
on  X  and  <  is  a  binary  relation  on  Sa.  Then  there  is  a  real-valued  function  u  on 
X  that  satisfies 

P<Qo  E(u,  P)  <  E(u,  Q),  for  all  P,QeSs  (8.3) 

if  and  only  if,  for  all  P,  Q,  Re'S  „ 

1.  <  on  $s  is  a  weak  order, 

2.  {P  <  Q,  0  <  a  <  1)=>  xP  +  (1  -  x)R  <  olQ  +  (1  -  tt)R, 

3.  (P  <  Q,  Q  <  R)  =>  *P  +  (1  ~  «)R  <  Q  and  Q  <  fiP  +  (l  —  P)R  for 
some  a,  €  (0,  l). 

Moreover,  u  in  (8.3)  is  unique  up  to  a  positive  linear  transformation:  that  is, 
if  u  satisfies  (8.3)  then  a  real-valued  function  v  on  X  satisfies  P  <  Q  o 
E(v,  P)  <  E{ v,  Q),for  all  P,  Q  e  (F„  if  and  only  if  there  are  numbers  a  >  0 
and  b  such  that 

v(x)  =  au(x)  -f  b  for  all  xe  X.  (8.4) 

Suppose  we  extend  u  to  ST,  by  defining  u(P)  —  E(u,  P).  Then,  if  (8.3)  holds, 
p  -<  Q  o  u(P)  <  «({T).  Now  if  v  on  (T,  is  any  order-preserving  (not  neces¬ 
sarily  linear)  transformation  of  u  on  St,  then  P  <  Q  o  v(P)  <  v(Q).  Given 
such  a  v  we  can  define  v  on  X  by  v(x)  =  v(P)  when  P{x)  =  1.  However,  if  v 
is  not  a  linear  transformation  of  u  then  v(P)  —  Ely,  P )  must  be  false  For  some 
PelS,.  In  other  words  there  are  functions  v  on  3,  that  satisfy  P  <  Qo 
v(P)  <  v(Q)  but  do  not  satisfy  P  <  Q  o  E(v,  P)  <  E(v,  (?)  when  v  on  X  is 
defined  from  v  on  T,  in  the  manner  indicated  (provided  that  P  <  Q  for 
some  P,  Q  e  $s). 


108 


Expected  Utility  with  Simple  Measures 
Condition  1:  Weak  Order 

Condition  1,  weak  order,  can  easily  be  criticized  for  its  implication  of 
transitive  indifference.  For  example,  let  consequences  be  amounts  of  money 
viewed  as  potential  increments  to  one’s  present  wealth.  Let  i*($35)  =  l 
<2(*36)  =  l,  and  *(S0)  =  *($100)  =  .5.  Surely  P  <  Q.  But  it  seems  quite 
possible  that  P  ~  R  and  Q  ~  R,  in  which  case  < — >  is  not  transitive. 

For  this  reason  the  next  chapter  examines  the  case  where  <  on  3*  is  only 
assumed  to  be  a  strict  partial  order.  We  shall  not  consider  interval  orders  and 
semiorders  per  se,  as  in  Chapter  2,  for  conditions  plQ  and  pi  1  of  Section  2.4 
are  liable  to  criticisms  of  the  sort  given  above.  For  example,  if  g'($35.50)  — 

1 ,  then  P  <  Q’  <  Q  but  R  might  be  indifferent  to  each  of  these,  which  would 
violate  pi  1.  Moreover,  if  <  on  3,  is  assumed  to  be  irreflexive  and  to  satisfy 
pli,  and  if  condition  2  of  Theorem  8.2  holds  then  ~  on  (F,  is  transitive  For 
suppose  to  the  contrary  that  {P~Q,Q~RtP<  p).  Then,  by  condition 
2°n  f  <  f  *  P  “  and  \P  +  \R  <  \R  +  \R  =  R,  SO 

that,  bypll,  P  <  Q  or  Q  <  R  which  contradicts  (P~Q,  p). 

Condition  2:  Independence 

Condition  2,  a  form  of  independence  axiom,  is  regarded  by  many  as  the 
core  of  expected-utility  theory,  for  without  it  the  “expectation”  part  of 
expected  utility  vanishes.  Moreover,  this  condition  is  often  regarded  as  a 
principal  normative  criterion  of  the  theory,  along  with  transitivity  of  <. 

otP  +  (1  -  a)R  may  be  viewed  in  two  ways:  either  as  a  gamble  that  yields 
xeX  with  probability  «?(*)+  (1  -  «)*(*),  or  as  a  two-stage  process 
whereby  P  (or  R)  is  selected  in  the  first  stage  with  probability  a  (or  1  —  a) 
and  then  x  is  selected  at  the  second  stage  using  the  one  of  P  and  R  already 
selected.  These  two  interpretations  are  probabilistically  identical  although 
they  are  not  psychologically  identical.  For  example,  you  might  find  the 
two-stage  process  more  exciting. 

As  a  normative  criterion,  (P  <  Q,  0  <  a  <  1)  =>  aP  +  (l  _  a).S  <  ag  + 

(1  -  a )R  is  usually  defended  with  the  two-stage  argument.  If  you  prefer  Q  to 
P  then  it  seems  reasonable  in  view  of  the  two-stage  interpretation  that  you 
should  prefer  xQ  +  (1  —  a)R  to  xP  -p  (1  —  a)P,  or  that,  in  the  following 
payoff  matrix,  you  should  prefer  A  to  B  when  you  have  a  choice  between 

a  1  —  a 


Option  A  Q  R 
Option  BP  R 

A  and  B  and,  independent  of  your  choice,  a  “coin”  with  probability  a  lor 
“heads”  and  probability  I  -  «  for  “tails”  is  flipped  to  determine  the 
appropriate  column. 


Expected  Utility  for  Simple  Measures 


209 


Condition  2  has  several  related  functions  as  a  guide  in  making  consistent 
preference  judgments.  First,  it  may  help  to  uncover  preferences  between 
more  complex  alternatives  on  the  basis  of  preferences  between  simpler 
alternatives.  Suppose  that,  initially,  a  person  has  no  clear  preference  between 

*  and  S  where 

*($50)  =  .10,  *($80)  =  ,45,  *($100)  =  .45 

£($0)  -  .02,  S($80)  =  .45,  S($I00)  =  .53, 

but  definitely  prefers  Q  to  P  when  Q($0)  =  .2,  2(S1G0)  =  .8,  and  *($50)  = 
1.  Let  *($80)  =s  *($100)  =  .5.  In  view  of  the  fact  that  S  =  AQ  +  .9* and 

*  =  .1*  +  .97,  his  preference  for  Q  over  *  may  convince  him  that  he 
should  prefer  S  to  *  even  though  he  might  feel  that  S  and  *  are  “very  close 
together." 

Condition  2  can  also  be  useful  in  uncovering  inconsistencies  in  preference 
judgments.  Consider  an  example  used  by  Savage  (1954,  pp.  101-103)  that  is 
due  to  Allais  (1953).  Which  of  Q  and  *  do  you  prefer  ? 

£($500000)  =  l ;  *($2500000)  =  .10, 

*($500000)  *  .89,  *($0)  =  .01. 

Also,  which  of  *  and  S  do  you  prefer? 

*($500000)  *  .11,  *($0)  ~  .89;  *($2500000)  =  .10,  *($0)  =  .90 

According  to  Allais  and  Savage  it  is  not  unusual  to  find  *  <  Q  and  *  <  *. 
Now  with  *($2500000)  =  *($0)  =  and  F($0)  =  1, 

£*=  .110  +  .89£ 

*=  .11*+  .89  Q 

and 

*  =  .1 1£  +  .89  F 

*  =  .11*+  .89  F. 

Since  condition  2  implies  the  converse  of  itself  in  the  presence  of  the  other 
conditions,  *<£=>*<£  and  *<*=>£<  *,  so  that  an  “inconsis¬ 
tency”  has  been  uncovered.  In  Allais’  viewpoint,  this  result  speaks  against  the 
reasonableness  of  condition  2.  On  the  other  hand,  Savage  suggests  that  many 
people  would  be  alarmed  at  the  apparent  inconsistency  and,  accepting  the 
“reasonableness”  of  condition  2,  wish  to  revise  their  initial  judgments  so 
that  the  revisions  are  consistent  with  the  condition. 

Condition  3:  An  Archimedean  Axiom 

The  third  condition  in  Theorem  8.2  says  that  if  *  <  Q  <  *  then  there  is 
some  nontrivial  mixture  of  *  and  *  that  is  less  preferred  than  Q,  and  also 


116 


Expected  Utility  with  Simple  Measures 


some  nontrivial  mixture  of  P  and  R  that  is  preferred  to  Q,  It  specifically 
prohibits  the  possibility  that  not  aP  4-  (1  <x)R  <  Q  for  all  a  e(0, 1),  or 
that  not  Q  <  aP  +  (1  —  <x)R  for  all  «  e  (0,  1)  when  P  <  Q  <  R. 

Suppose  that  a  newly  minted  penny  will  be  flipped  n  times  and  that,  for  any 
positive  a,  you  feel  that  there  is  an  n( x)  such  that  a  exceeds  the  probability 
that  every  one  of  the  n(oc)  flips  will  result  in  a  head.  Consider  a  choice  between 
A  and  £: 

A.  Receive  $1  regardless  of  the  results  of  the  n  flips, 

B.  Be  executed  if  every  flip  results  in  a  head,  and  receive  $2  otherwise. 

If  execution  <  $1  <  $2  and  if  you  prefer  A  to  B  regardless  of  how 
large  n  is  taken  to  be,  then  you  violate  condition  3.  If  the  coin  is  flipped 
100  times,  then  under  B  there  is  only  one  sequence  of  the  more  than 
1,000,000,000,000,000,000,000,000,000,000  possible  sequences  under  which 
you  would  be  executed.  In  view  of  such  numbers,  many  people  might  find  a 
satisfactorily  large  value  of  n  for  which  they  would  choc  5  B.  It  is  often 
claimed  that  the  willingness  that  many  people  show  toward  small  risks  such 
as  crossing  the  street  or  driving  a  car  is  sufficiently  convincing  evidence  in 
favor  of  the  condition. 

Despite  the  fact  that  condition  3  is  called  an  Archimedean  axiom,  it  and 
weak  order  do  not  imply  the  existence  of  a  «  on  if,  that  satisfies  P  <  Qo 
u(P )  <  u(Q).  In  other  words,  conditions  1  and  3  do  not  imply  (see  Theorem 
3.1)  that  includes  a  countable  subset  that  is  order  dense  in  ‘S s/~. 
Exercise  6  goes  into  this  further. 

Hausner  (1954)  considers  the  case  where  condition  3  is  not  assumed  to 
hold.  To  conditions  1  and  2  he  adds  the  indifference  version  of  condition  2, 
(P  ~  Q,  o  <  a  <  1)  =>  aP  +  (1  —  a )R  ~  <x.Q  (1  —  a )P,  which  as  we 
shall  see  in  the  next  section  is  implied  by  conditions  1 ,  2,  and  3.  His  axioms 
imply  a  lexicographic  form  of  ex$5ected  utility,  but  the  dimensionality  of  this 
form  might  not  be  finite.  In  the  2-dimensional  case  his  representation  would 
beP  <  Q  o  (£(«!,  P),  E(uz,  P))  <L  (E(uu  Q),  E(u.if  0)  where  wxand  uz  are 
real-valued  functions  on  X  and  <L  is  defined  as  in  (4.10). 

8.4  MIXTURE  SETS 

We  shall  now  develop  and  piove  a  theorem  that  is  more  general  than 
Theorem  8.2.  The  reason  for  this  is  that  the  more  general  theorem  will  be 
used  in  later  developments,  especially  in  Chapter  13.  The  generalization  uses 
Herstein  and  Milnor’s  (1953)  definition  of  a  mixture  set. 

Definiticn  8.3.  A  mixture  set  is  a  set  tT  and  a  function  that  assigns  an 
element  «P  +  (1  —  *)0  in  if  to  each  a  e  [0, 1  ]  and  each  (P,  Q)c‘S  x  ‘J  such 


Mixture  Sets 


111 


that,  for  all  P,  Q  e  IT  and  a,  /?  e  [0,  1], 

ML  IP  4-  00  =  P, 

Ml.  ctP  f  (l  —  tx)Q  =  (1  —  a)Q  +  aP, 

M3,  a [fiP  +  (1  -  p)Q]  +  (1  -  *)Q  =  */SjP  +  (1  -  a/9)0. 

The  iTg  with  aP  4-  (i  —  a )Q  as  in  Definition  8.2  is  a  mixture  set.  Along 
with  Ml  through  M3  we  shall  use  the  following: 

M4.  *P  +  (1  -  «)P  -  P, 

MS.  a[/3 Q  +  (1  -  P)A]  +  (1  -  a )[yQ  4  (1  ~  y)P] 

=  M  +  (1  ~  a)y]0  +  [«(1  “  P)  +  (1  -  «)(1  -  y)]P. 

The  first  of  these  follows  from  MI-M3  as  follows:  ocP  +  (1  —  a)P  = 
<x[lP  +  OP]  +  (1  -  a )P  #=  a  [OP  4-  IP]  +  (1  -  a)P  =  OP  +  IP  =  IP  + 
OP  =  P.  The  second  follows  easily  from  MI-M3  if  or  y  equals  0  or  1. 
Henceforth,  to  verify  M5  for  a  mixture  set,  we  suppose  that  /?,  y  e  (0,  1)  and 
that  <,  y  for  definiteness.  Following  Luce  and  Suppes  (1965,  p.  288): 

[ft?  +  (1  -  «)y]<2  +  [«0  -/?)  +  (!-  a)(l  -  y)]R 
—  {[v-Piy  'I-  (1  -  a)]y}<2  +  {1  -  [a Ply  4-  (1  -  a)]y}P 
=  [a Ply  +  (1  ~  a )][yQ  4-  (1  —  y)P]  +  [1  -  etpfy  -  (1  -  a)]P 

by  M3 

=  [*(1  -  mi*  +  [1  -  «(1  -  mibQ  +  (1  -  Y)*l  by  Ml 

=  «{(i  -  m*  +  cmiyQ  +  a  -  y)m  +  a  -  «)b<2  +  o  -  y)*i 

by  M3 

=  *{0Vy)[yG  +  a  -  y)*]  +  0  -  0/y)P}  +  (l  -  «)[yfi  +  (l  -  y)P] 

by  3/2 

=  a[/30  +  (l  -  0)P]  4-  (1  -  *)[ye  +  (1  -  y)P]  by  M3. 

As  a  preface  to  the  main  theorem  we  consider  a  succession  of  lemmas,  as 
incorporated  in  the  following  theorem.  Conclusion  5  of  the  theorem  is  due  to 
Jensen  (1967).  ~  and  ^  are  defined  as  in  (2.2)  and  (2.3). 

THEOREM  8.3.  Suppose  that  (f  is  a  mixture  set  and  that  the  following  hold 
for  all  P,  Q,  R  e  ST: 

A 1 .  -<  on  tf  is  a  weak  order, 

A2.  (P  <  Q,  0  <  a  <  1)  =>  aP  +  (1  —  a )R  <  a0  4-  (1  —  a)P, 

A3.  (P  <  Q,  Q  <  R)  =>  «P  4  (1  -  a)R  <  Q  and  g<^+(l-  f)R 
for  some  cl,  ft  £  (0,  1).  Then,  for  all  P,  Q,  R,  S  e  ‘S, 


112 


Expected  Uti/ity  with  Simple  Measures 


Cl.  (P<  Q,Q  £  l)=>j]p  + 

Cl.  (P  <  Q,  2  <  R,  P  <  K)^Q~ 

«e[Osl], 


(I  ~P)Q<  «P  +  (1.-  a )Q, 
a.P  +  (1  —  ct)R  for  exactly  one 


C3.  (P  <  Q,  R  <  S,  0  <;  a  <;  1)  =>  aP  +  (1  -  0P  <  *2  +  (1 

C4.  (P  ~  2*  0  <,  a  ^  1)  =>  otP  -f.  (1  —  a)g  ^  p f 

C5.  (P~Q,  0^  a  ^  1)  =>«/»+  (1  -  0P~«2  +  (1  ~  0p, 


—  sOS, 


Proofs  CL  If  /J  <  I,  #■  +  <1  -  00  <  /JC  +  (l  -  m  by  ,42,  and 
hence  0P  +  (I  -  00  <  2  by  3/4.  If  /?  =  1,  then  /?/»  +  (I  -  0£  <  o  by 

<  the?  (a//W  +  (1  "  »fil  +  0  ~  «/0D«P  +  (1  -  00]  < 
(^lP)iPP  +  (1  —  0g]  +  (1  —  a/02  by  42,  and  hence  ftP  +  (\  —  0O  < 
a/»  +  (1  -  a)2  by  3/3  and  M4.  If  a  =  0,  jSP  +  (1  -  02  <  ap  +  (i  _  a)o 
by  Ml  and  M2. 

C2.  Suppose  first  that  Q~p.  Then  Q  ~  IP  +  OP  by  Ml ,  and  IP  + 
OR  <JP  +  (1  -0)R  for  every  0  <  1  by  Cl  and  M2.  Then,  by  transitivity 
(see  Theorem  2.10,  a  =  1  is  the  unique  a  e  [0,  1]  for  which  Q  ~  *p  + 
(1  —  a)R.  A  symmetric  proof  holds  if  Q  ~  R  (in  which  case  a  =  0).  Finally, 
it  P  <  Q  <  R,  the  proof  of  Lemma  3.1  applies  with  the  obvious  notational 
changes  and  the  use  of  Cl,  41-43,  and  M2  and  M3. 

C3.  If  0  <  a  <  1,  «P  +  (1  -  0P  <  xQ  +  (1  -  a )R  and  (1  -  a)R  + 
*2  <  (1  —  a.)S  +  a.Q  by  A2. 

C4‘  Suppose  P~  Q  and  aP  +  (l  ~  02  <  p.  Then  «p  +  (1  -  a)Q  <  Q 
(Theorem  2.10.  Then,  by  C3,  a[aP  +  (1  -  02]  +  (i  _  a)[a/>  + 

aP  +  (I  “  a)e°r’  by  M4’  ^  +  (*  -  02  <  a P  +  (1  -  02, 
which  is  false.  Similarly,  not  (P  ~  Qt  p  <  KP  +  (1  -  02).  Hence P  ~  O 
aP  +  ( 1  —  02  ~P. 


C5.  Ml  and  3/2  yield  the  conclusion  if  a  e  {0,  1}.  Take  (P~0 
0  <  a  <  1).  If  R  — P  then,  by  C4,  aP  +  (1  -  a)P  ~p  ~  Q  ~  «2  + 

y  ~  a^’  0r  +  (1  —  0P~a2  +  (1  —  0P.  Henceforth  take  R<  P 
(the  P  <  R  proof  is  similar).  Then  P  <  ocP  +  (1  -  0P  by  C!  and  3/4 
Suppose  also  that  «/>+(!-  a)/!  <  afi  +  (1  _  «)R.  Then,  by  C2 

“  +  (1  -  «)/f  ~  (I  -  ,})/(  +  flag  +  (1  -  a)/?]  for  a  unique  fl  s  (0,  I). 
Hence  aP  +  (I  -  0P~  *PQ  +  (!  _  a0/l  by  3/2  and  3/3.  Also,  since 
P  <  2,  (1  -  0P  +  PQ  <  2  ~P  by  Cl  and  3/4:  hence  8Q  +  (l  —  < 

P  by  ,41  and  3/2:  then  a  [00  +  (1  -  0P]  +  (1  -  *)R  -<  aP+  (1  -  0p  by 

42:  finally,  xpQ  +  (1  -  a/i)P  <  aP  +  (1  -  a)P  by  3/3,  thus  contradicting 
aP  +  (1  —  0P  ~  «^2  +  (1  —  a0P.  Hence  aP  -f  (I  —  0p  <  a2  + 
(1  -  a)P  is  false.  Similarly  xQ  +  (1  -  0P  <  aP  +  (1  -  0P  is  false.  Hence 
aP  -f  (1  —  a)P~<x2  +  (1  —  0P.  ^ 


The  Main  Theorem 


THEOREM  8.4.  Suppose  ‘J’  is  a  mixture  set.  Then  41,42,  and  A3  of  Theorem 
8.3  hold  for  all  P,  Q,  R  e  S  if  and  only  if  there  is  a  real-valued  function  u  on 


Mixture  Sets 


113 


(T  such  that 


P<Qo  u(P)  <  u{Q),  for  all  P,Qe$  (8.5) 

u(xP  +  (1  —  et)Q)  =  txu(P)  -f  (1  —  a)u(Q),  for  all  (a;  P,  Q)  e  [0,  1]  x  (T2. 

(8.6) 

Moreover ,  if  u  on  $  satisfies  (8.5)  and  (8.6)  then  a  real-valued  function  v  on  (T 
satisfies  (8.5)  and  (8.6)  with  u  replaced  by  v  if  and  only  if  there  are  numbers 
a  >  0  and  b  such  that 

v(P)  =  au(P)  -f  b  for  all  P  e  3*.  (8.7) 

Theorem  8.2  results  from  this  when  tP  =  and  u  on  X  is  defined  from  u  on 
O'  by  m(x)  =  a(P)  when  P(x)  ~  1.  If  {x:P(x)  >  0}  =  {xu  . . .  ,  xnj  then 
repeated  applications  of  (8.6)  with  Pi  e  :fs  such  that  P{ (xi)  =  1  give  u(P)  ~ 

=  E(u,  P),  so  that  P<Q^>  E(u,  P)  < 
E(u,  Q)  by  (8.5).  (8.4)  follows  from  (8.7). 

The  necessity  of  Al,  A2,  and  A3  for  (8.5)  and  (8.6)  is  obvious.  To  prove 
sufficiency.  Part  l  of  the  following  proof  shows  that  (8.5)  and  (8.6)  hold  on 
RS  =  {P:R  P  ^  S}  when  R  <  S.  We  assume  R  <  S  for  some  R,  S  e  (T 
for  otherwise  the  conclusion  is  obvious.  Part  II  extends  (8.5)  and  (8.6)  to  all 
of  3*.  Part  HI  verifies  (8.7). 

Proof,  Part  /.  Assume  that  Al,  A2,  and  A 3  hold  and  that  R  <  S.  Let 
RS  —  {P:P  e  ff,  R  P  ^  S}.  By  C2  there  is  a  unique  number /(P)  e  [0,  1] 
for  each  P  e  RS  such  that 

P~  [1  -f(P)}R  +f{P)S,  with  f(R)  =  0  and  f(S )  -  1.  (8.8) 

Suppose  P,  Q  e  RS  and /(P)  <f(Q).  Then,  by  Cl,  [1  -  f{P)]R  +f(P)S  < 
[l  —  f(Q)]R  +  f(Q)S.  Transitivity  and  (8.8)  then  giveP  <  Q.  On  the  other 
hand,  if /(P)  =  f(Q)  then  (8.8)  and  transitivity  imply  P  ~  Q.  Thus 

P  <  Q  <>f(P)  <  /((?),  for  all  P,Qe  RS.  (8.9) 

If  P,  Q  e  RS  and  a  e  [0,  1]  then  aP  +  (1  —  x)Q  e  RS.  If  a  e  {0,  1}  this 
follows  from  Ml  and  M2.  If  0  <  a  <  1  then  R  =  aP  -f  (1  —  a)P  ^  aP  4- 
(1  -  a )R  =  (1  -  a)P  +  aP  <  (I  ~  a )Q  +  aP  =  aP  +  (1  -  a )Q  <  a S  + 
(1  -  a )Q  =  (1  -  a )Q  +  aS  <  (1  -  a)S  +  aS  =  S  by  A/4,  A2  oi  C5,  M2, 
A2  or  C5,  M2,  A2  or  C5,  M2,  A2  or  C5,  and  A/4,  in  that  order. 

Therefore,  if  P,  Q  e  RS  and  a  e  [0,  1]  then,  by  (8.8), 

aP  +  (1  -  a )Q  -  [1  -/(aP  +  (1  -  a)0]P  +  /(aP  +  (1  -  <k)Q)S. 

(8.10) 


114 


Expected  VtUitv  with  Simple  Measures 


In  addition,  by  two  applications  of  C 5, 

<xP  +  (1  -  a)<?  «{[1  -f(P)]R  -F/(P)S) 

+  (1  -  <*){[!  -m))R  +f(Q)Sl 

so  that,  by  MS, 

«/>  +  (1  -  a )Q  -  [1  -  a/(P)  -  (1  -  °O/(0]P 

+  [a/(P)+(l-a)/(0]S. 
From  this,  (8.10),  transitivity,  and  Cl  it  follows  that 
f(oiP  +  (1  -  a )0  =  a/(P)  +  (1  -  «)/(0,  for  all  («,  P,  0  e  [0, 1]  x  PS*. 

(8.11) 

(8.9)  and  (8.1!)  verify  (8.5)  and  (8.6)  on  PS. 

Proof,  Part  II.  To  extend  this  to  all  of  O’.  PS  with  P  <  S,  and  let 
P,St  =  {P:P  e  IT,  Pf  <  P  <  PJ  be  such  that  vS  c  p.p.  for  /  =  1,  2.  Let 
/*  on  P(S,  satisfy  (8.5)  and  (8.6)  for  (a,  P,  0  e  [0,  1]  x  P,Pt?,  as  guaranteed 
by  Part  I.  Lttf  be  a  positive  linear  transformation  of /*  so  that/^P)  =  0 
and /((S)  =  1  for  /  =  1,  2,  The f  must  satisfy  (8.5)  and  (8.6)  for  (a,  P,  0  G 
[0,  1]  X  R(Sf. 

Suppose  P  g  PjSj  C  P2S2.  If  P  ~  P  or  P  ~  P  then  7i(P)  =  /2(P)  by  the 
definitions.  Three  possibilities  remain  as  shown  here  with  the  unique  element 


in  (0,  1)  as  guaranteed  by  C2  and  strict  preference: 

P<R<S,  P  ~  (1  -  a)P  +  aS  (8.12) 

R<P<S,  P~  (1  -  /3)P  +  /5P  (8.13) 

R<S<P,  P~  (1  -  y)P  +  yP-  (8.14) 

Using  (8.5)  and  (8.6)  on  each  of  these  we  get,  for  /  =  1,2, 

0  -  (1  —  ®.)/,(P)  +  *  («*1)  (8.12*) 

/i(P)  =  /?  (8.13*) 

i  =  y/i(P)  (y*0)  (8.14*) 

respectively,  so  that  fx(P)  =  /a(P)  in  each  case. 


Finally,  let  «(P)  be  the  common  value  of /<(P),  as  assured  by  the  foregoing, 
for  every  interval  of  the  form  P(P,  containing  P,  P,  and  P.  Since  every  pair 
P,Qs  if  is  in  at  least  one  such  interval  it  follows  that  u  is  defined  on  all  of  O' 
and  satisfies  (8.5)  and  (8.6). 

Proof,  Part  III.  If  u  satisfies  (8.5)  and  (8.6)  and  v  satisfies  (8.7)  with  a  >  0 
then  v  obviously  satisfies  (8.5)  and  (8.6).  To  go  the  other  way,  suppose  v 
satisfies  (8.5)  and  (8.6)  along  with  u.  If  u  is  constant  on  O'  then  so  is  v  and  they 


Exercises 


115 


are  related  by  the  positive  linear  transformation  v(P )  —  u{P)  +  (c'  ~~  c) 
where  use,  v  =  c' .  On  the  other  hand  suppose  that  R  <  S  for  some 
R,  S  £  3\  With  such  R  and  S  fixed  let 


UP)  = 


u(P)  -  u(R) 


UP)  =  ^ ^ 
v(S)  -  v(R) 


for  all  Petr.  (8.15) 


u(S)  -  u(R) 

Since /j  and  f2  are  positive  linear  transformations  of  u  and  d,  both  satisfy 
(8.5)  and  (8.6).  Moreover/^/?)  =/2(P)  =  0  and  US)  =ft(S)  =  1.  IfP~P 
or  P~S  then fx(P)  =/2(P).  Or  if  (8.1  A:)  holds  then/x(P)  =  /»(/>)  by  (8. lit*) 
for  k  =  2,  3,  4.  Hence  fx  =/8.  Then,  by  (8.15), 

t-(P)  =  u(p)  +  ^R)  _ 

u(S)-u(R)  v  u(S)-u(R) 

so  that  v  is  a  positive  linear  transformation  of  u.  ♦ 

8.5  SUMMARY 

When  a  decision  alternative  has  positive  probability  of  resulting  in  any 
consequence  in  a  finite  subset  of  consequences  and  the  probabilities  sum  to 
one,  then  a  simple  probability  measure  on  X  corresponds  to  the  alternative. 
Three  preference  conditions — weak  order,  independence,  Archimedean — for 
<  on  the  set  of  simple  probability  measures  imply  that  the  utili*  of  any 
measure  can  be  computed  as  the  expected  utility  of  the  conse  ,  -  ices  with 
respect  to  that  measure,  provided  that  the  consequence  utilities  are  defined  in 
a  manner  consistent  with  the  expected-utility  model. 

For  a  general  theory  we  defined  the  notion  of  a  mixture  set  and  applied  the 
three  conditions  to  it.  The  expected-utility  model  for  simple  probability 
measures  illustrates  one  application  of  the  general  theory.  Other  uses  of  the 
general  theory  occur  later. 


INDEX  TO  EXERCISES 

1.  Expected  net  profit.  2.  Simple  measures.  3.  Unbounded  utility.  4.  Positive  linear 
transformations.  5.  Independence  condition.  6.  Order  denseness.  7.  Independence.  8. 
Necessary  conditions.  9.  Expected  utility.  10-11.  Sequential  analysis.  12-13.  Certainty 
equivalents.  14.  Pfanzagi’s  “consistency”  axiom.  15.  Linear  additivity.  16.  Buying  and 
selling  prices. 


Exercises 

1.  Using  Figure  8.1,  sketch  a  curve  of  the  expected  net  profit  of  x,  similar  to 
Figure  8.3.  Approximately  what  x  value  maximizes  expected  net  profit?  Why  does 
this  differ  from  the  x  that  maximizes  expected  utility? 


1 16  Expected  Utility  with  Simple  Meuxmtex 

2.  Use  (3)  of  Definition  8.1  to  show  that  (a)  /*(( U"„i  '  X?  i^M<)  if  4*  r\ 

At  =  0  whenever  i  ^ y;  (A)  PM  u  /?)  =*  P(.4)  P(5)  —  P(.4  n  fl). 

3.  Show  that  (8, 3}  does  not  imply  that  u  is  bounded. 

4.  Let  uonX  ~  {x,  y ,  z,  h>}  satisfy  (8.3)  with  (u(x),  u(y),  u(z),  u(w))  —  {0, 1 , 2, 5). 

Assuming  tnat  v  satisfies  (8.4)  compute  d  on  I  when  (a)  v(x)  =  — ! ,  v(y)  =  1 ; 

( b )  v(x)  =  —10,  v(z)  =  50;  (c)  t^w)  =  2  and  d(*)  +  0(3/)  +  «>(z)  +  ^(w)  =  1; 

(rf)  v(x)v(w)  =  v(y)v(z)  =  150. 

5.  Consider  P  and  Q  as  defined  on  $  by  the  probability  matrix: 


$10  $30  $50  $100  $150 


P  .2  .3  .2  .1  .2 

Q  .4  .1  .1  .3  .1 


Consider  also  two  gambles  for  a  four-ticket  lottery  as  described  in  the  following 
payoff  matrix : 

Number  on  drawn  ticket  is 
1  or  2  3  4 


Gamble  A  $30  $50  $150 

Gamble  B  S10  $100  $100 


If  each  ticket  has  the  same  chance  of  being  drawn,  show  that  condition  2  of  Theorem 
8.2  implies  P  <  Q  if  A  <  B,  and  Q  <  P  if  B  <  A.  (Compute  a  and  R  that  satisfy 
P  —  a. A'  4-  (1  -  oi)R  and  Q  —  *B'  +  (1  -  a)R,  where  A'  and  B'  are  the  measures 
for  A  and  B.) 

6 .  With  <'  defined  on  as  in  (2.4)  let  condition  4  be:  there  is  a  countable 
subset  of  $,l~  that  is  -order  dense  in 

a.  Show  that  condition  1  (of  Theorem  8.2)  and  condition  4  do  not  imply  con¬ 
dition  3.  (Define  <  by  P<Q-<R~S  where  P(x)  =  Q(y)  =  1  for  x,  y  e  X 
and  R  and  S  are  any  two  measures  in  —  {/*,  Q}.) 

b.  Show  that  conditions  1  and  3  do  not  imply  condition  4.  (Let  X  =  {x,  y],  let 
3*,  be  represented  by  [0, 1]  where  p  e  [0,  1]  is  the  probability  assigned  to  x, 
and  let  A  =  {p:0  <.p  <.  1/2,^  is  rational},  B  -» {p:  0  <  p  <  l,p  is  irrational}, 
C  =  {p:l/2  <p  &l,pis  rational}.  Define  -<  by:p  ~q\fp,q  eA  orp,  qs  C 
or p  =  q',p  -<7  if  (pe  A,q$  A)or  (p$  C,  qe  C)  or  (p,qG  B and  | p  —  1/2|  < 
\q  -  1/2|)  or  (p,q€B,p  <  q,  and  | p  -  1/2|  =  | q  -  1/2|).) 

c.  Show  that  conditions  1,3,  and  4  do  not  imply  condition  2.  (Define  -<  by 
P  <  Q  '■*'  Tfor  all  Jin  O',  —  {P,  Q }  as  in  part  a.) 

d.  Prove  that  conditions  !,  2,  and  4  imply  condition  3. 

e.  Argue  that  conditions  1,  2,  and  3  imply  condition  4.  (See  Theorem  3.1.) 

7.  Show  that  condition  2  is  not  implied  by  conditions  1  and  3  of  Theorem  8.2 
and  C5  of  Theorem  8,3. 

8.  Show  that  Al,  A2,  and  A 3  of  Theorem  8.3  are  implied  by  (8.5)  and  (8.6). 


Exercitei 


117 


9.  Give  detail;,  for  the  assertions  in  the  paragraph  following  Theorem  8,4. 

10.  Consider  the  following  two  alternatives: 

Alternative  A.  One  fair  coin  is  flipped.  If  it  lands  ‘‘heads’’  you  get  steak  for  dinner 
every  night  for  the  next  three  nights;  if  it  lands  “tails”  you  get  chicken  for  dinner 
every  night  for  the  next  three  nights. 

Alternative  B.  On  each  of  the  next  three  days  a  fair  coin  is  flipped  to  determine 
whether  you  get  steak  (if  “heads”)  or  chicken  (if  “tails”)  for  dinner  that  evening. 

Let  X  be  the  set  of  eight  triples  (xltx^xz)  where  xi  &  {chicken,  steak}  for  i  = 
1,2,3  and  specify  P  and  QonX  that  correspond  to  alternatives  /  and  B  respectively. 
Can  you  think  of  any  reasonable  argument  why  P  ~  Q  ought  to  be  true?  Identify 
your  own  preference  in  this  case  and  explain  why  you  prefer  the  one  alternative 
to  the  other  if  you  are  not  indifferent.  If  you  are  indifferent,  would  you  remain 
indifferent  if  the  example  were  phrased  in  terms  of  100  nights  rather  than  three 
nights  ? 

11.  Consider  the  following  two  pairs  of  gambles  in  which  the 

JA.  Get  $10  with  pr.  .3  or  $50  with  pr.  .7 
Get  $0  with  pr.  .2  or  i?U  with  pr.  .8 

JC.  Get  $20  with  pr.  .9  or  $70  with  pr.  .1 

}Z>.  Get  $40  with  pr.  .6  or  560  with  pr.  .4 

amounts  of  money  are  to  be  considered  as  possible  increments  to  your  wealth  as 
of  this  moment.  In  considering  your  preference  between  A  and  B  the  correct  inter¬ 
pretation  of  the  expected-utility  theory  says  that  you  should  disregard  C  and  D: 
that  is,  suppose  you  have  a  choice  between  A  and  B  and  that  these  are  the  only  two 
alternatives  you  can  select  between  and  the  only  two  that  can  change  your  financial 
position  in  the  near  future.  Similarly,  disregard  A  and  B  when  you  consider  your 
preference  between  C  and  D. 

a.  Now  suppose  you  are  allowed  to  choose  either  A  or  B  and  either  C  or  D  before 
either  of  your  choices  is  actually  played  oat.  You  then  have  four  alternatives, 
say  (A,  C),  ( A ,  D),  ( B ,  C),  and  ( B ,  D).  For  each  of  these  four  alternatives 
specify  the  corresponding  measure  on  amounts  you  might  win.  Does  the  theory 
in  this  chapter  imply  that  if  A  <  B  and  C  <  D,  as  in  the  preceding  paragraph, 
then  ( B ,  D)  will  be  preferred  to  the  other  three  alternatives  in  the  new  situation  ? 
Why  not  ? 

b.  Suppose  you  can  select  either  A  or  B  and  then,  after  your  selection  has  been 
played  out,  you  can  choose  either  C  or  D  and  have  this  second  choice  played 
out.  Show  that  you  have  eight  strategies  in  this  case,  one  of  which  is:  (Select 
A;  if  $10  results  then  choose  C  and  if  $50  results  then  choose  D).  Make  out 
a  table  that  identifies  the  eight  strategies  and  shows  the  probability  measure 
on  totals  you  might  win  with  each  strategy. 

12.  Let  x  «  0  represent  your  present  wealth.  If  P  is  a  probability  measure  on 
amounts  of  money  that  represent  potential  incremental  additions  to  your  present 
wealth  and  if  P  ~  (where  $x  is  considered  as  a  sure-thing  addition  to  your 
present  wealth)  then  S#  is  a  certainty  equivalent  for  P.  P  ~  means  that  you  would 


118 


Expected  Utility  with  Simple  Measure* 


be  indifferent  between  gambling  with  P  and  “receiving''  $*  as  an  outright  gift. 
Estimate  your  certainty  equivalent  for  P  when  (<i)  P(t 0)  =  .5,  /’(SiOOOO)  =  .5: 
(ft)  /’(SO)  »  .1,  P(SLOOO,000)  =  .9,  (c)  P(  — $500}  =*  .5,  P($500)=.5;  (</) 
P(-$I00)  -  .2,F(-5iO)  -  8;(e)P($0)  =  l/3tP($1000)  =  1/3,.P($3000)  =  1/3; 
(/)/>($ 90000)  =  .5,  P($100000)  =  .5. 

13.  ( Continuation .)  Estimate  your  certainty  equivalent  for  each  of  the  following 
probability  measures. 

a.  P($Q)  -  .01,  P(S5000)  =  .99.  ~P 

ft.  G(S0)  =  .99,  G($5O0O)  =  .01.  Sy  -  Q 

Cx.  R($ 0)  *  .50,  i?($5000)  -  .50.  $2  ~  R. 

Show  that  the  expected-utility  theory  implies  that  R  ~  ^P  +  \Q.  Does  this  mean 
that  $z  =  £($a:  +  %y) ?  Does  it  mean  that  $z  is  indifferent  to  a  50-50  gamble  between 
5*  and  $y  ? 

14.  Let  X  =  Re,  let  u  satisfy  (8.3)  with  x  <  y  implying  x  -<  y  and  with  u  on  X 
continuous.  Pfanzagl  (1959)  considers  an  axiom  which  when  translated  into  this 
context  reads  as  follows:  If  P(x  +  y)  —  Q{x)  for  all  xe  X  and  if  Q  ~  z  with  z  e  X 
then  P  ~y  +  z,  [Thus,  if  P(x  +  y)  =  Q(x)  for  all  x  e  X  and  if  z  is  the  certainty 
equivalent  for  Q  then  y  +  z  is  the  certainty  equivalent  for  /*.] 

a.  Under  the  stated  conditions  Pfanzagl  shows  that  u  on  X  must  have  one  of  the 
following  three  forms  (unique  up  to  a  positive  linear  transformation): 

1.  u(x )  =»  kx  with  k  >  1,  or 

2.  u(x)  »  —k*  w<»h  0  <  /:  <  1 ,  cr 

3.  u{x)  =  x. 

Show  that  each  of  these  expressions  satisfies  the  axiom  stated  above.  Plot  (1) 
with  k  =  2,  plot  (2)  with  k  =  and  plot  3. 

ft.  Comment  on  whether  you  think  this  axiom  is  valid  for  you.  (Consider,  for 
example,  your  answers  to  parts  a  and /  of  Exercise  12.) 

15.  ( Continuation .)  Let  X  =  Re  and  let  the  other  conditions  in  the  first  sentence 
of  Exercise  14  hold.  Show  that  u  on  Jfis  linear  [i.e.,  case  (3)  in  Exercise  14]  if  either 
(a)  or  (6)  as  follows  holds  with  x  ^  y; 

a.  For  all  x,  y  e  X  and  all  a  e  [0,  1],  Q  ~P  whenever  Qipx  +  (1  —  p)y)  =  1 
and  P(x)  =  p,  P(y)  =  1  -  p. 

ft.  For  all  *,  y  e  X,  Q  ~  P  whenever  Q((x  +  y)/2)  =  1  and  P(x)  —  Ply)  —  J. 

c.  Give  a  critique  of  these  conditions. 

16.  A  man  estimates  his  present  wealth  at  $50000.  Let  x  =  0  correspond  to 
his  present  wealth  and  consider  possible  changes  of  amounts  510000a:  in  his  present 
wealth,  as  shown  on  Figure  8.4,  where  u(x)  is  plotted.  For  example,  *  ~  2  represents 
an  addition  of  $20000  to  his  present  wealth.  We  assume  that  u  has  been  measured 
in  accord  with  the  expected-utility  model.  Let  A  be  a  50-50  gamble  that  pays  either 
50  or  $40000. 

a.  Use  Figure  8.4  to  estimate  the  certainty  equivalent  of  A  (see  Exercise  12). 
Write  out  the  indifference  statement  that  defines  the  certainty  equivalent  in 


t  tffrtf*; 


1 19 


Figure  8.4  Utility  function  for  possible  changes  in  present  wealth: 
$10000  x  is  amount  of  change  (see  Exercise  8.16). 


terms  of  changes  in  present  wealth,  denoting  the  certainty  equivalent  by  y. 
(Answer:  y  ~  ($40000  with  pr.  |  or  $0  with pr.  £).] 

b.  If  the  man  is  given  A  as  a  gift,  what  is  the  least  amount  he  would  sell  it  for? 
Letting  y  denote  his  minimum  selling  price,  write  the  indifference  statement 
that  defines  y  ,  and  compare  to  the  answer  in  («). 

c.  If,  instead  of  being  given  A,  the  man  considers  buying  it,  what  is  the  most 
he  would  pay  for  it?  Letting  z  be  the  most  he  would  pay  to  take  possession  of 
A,  write  the  indifference  statement  that  defines  2. 

d.  Suppose  the  man  actually  buys  A  for  the  amount  specified  in  the  answer  to 
(c).  Will  he  then  be  willing  to  sell  it  (before  it  is  played  out)  for  the  amount 
specified  in  the  answer  to  (b)?  Why  not?  What  would  he  be  willing  to  sell  it 
for  after  buying  it? 

e.  Instead  of  buying  A  for  the  amount  specified  in  (c)  suppose  he  gets  it  at  a 
bargain  price,  say  for  $15000.  Having  bought  A  for  $15000,  what  is  the  mini¬ 
mum  amount  he  would  sell  it  for?  Write  the  defining  indifference  statement 
with  w  the  minimum  amount. 


120 


Expected  t'tiiSfy  trirk  Simp!?  Si  wares 


/.  Suppose  the  mao  is  given  .4  as  a  gi*Y  He  now  «  girert  ao  oppofnifnty  to  buy 

a  second  gamble,  also  an  even-chance  gamble  for  $0  or  $40u'0,  before  A  is 
played  out.  What  is  the  most  he  would  be  willing  to  pay  for  the  second  gamble? 
Letting  r  be  the  most  he  would  pay,  write  out  the  indifference  statement  that 
defines  r.  (Do  not  make  the  mistake  of  asserting  that  r  ~  y'.) 
g.  Suppose  the  man  buys  A  for  $15000  and  is  then  given  an  opportunity  to  buy 
a  second  gamble  just  like  A  before  A  is  played  out.  What  is  the  most  he  would 
pay  for  this  second  gamble  ?  Let  s  be  this  amount  and  write  out  the  indifference 
statement  that  defines  s. 


Chapter  9 


EXPECTED  UTILITY  FOR 
STRICT  PARTIAL  ORDERS 


This  chapter  examines  the  important  generalization  of  expected  utility  for 
simple  probability  measures  when  indifference  on  (Ts  is  not  assumed  to  be 
transitive.  We  shall  consider  the  representation 

P<  Q=>E(u,P)<  E(u,  Q ),  for  all  P,  Q  e  0*.  (9.1) 

in  the  context  where  X  is  finite.  Aumann  (1962)  and  Kannai  (1963)  discuss 
the  difficulties  that  arise  when  X  is  infinite  and  Kannai’s  paper  contains 
several  important  theorems  for  this  case. 

The  utility  theory  in  this  chapter  is  largely  due  to  Aumann  (1962).  Although 
he  assumes  that  ^  is  a  quasi  order  (reflexive,  transitive),  minor  revisions 
make  his  work  applicable  to  the  case  where  <  is  a  strict  partial  order 
(irreflexive,  transitive). 

Section  9.1  presents  an  expected-utility  theorem  and  discusses  its  condi¬ 
tions.  The  second  section  develops  a  support  theorem  for  convex  cones  in 
Re”.  The  third  section  proves  the  utility  theorem  with  the  use  of  the  support 
theorem. 

9.1  AN  EXPECTED  UTILITY  THEOREM 

In  the  following  theorem  3“,  is  the  set  of  simple  probability  measures  on  X, 
as  in  Chapter  8.  <&P  4-  (1  —  a)Q  is  the  direct  linear  combination  of  P,  Q  s  (ft. 
E(u,  P)  =  2x  «( 

THEOREM  9.1.  Suppose  that  X  is  a  finite  set  and  that  the  following  hold 
throughout  if,  for  a  binary  relation  <  on  $t: 

1.  <is  transitive , 

2.  If  0  <  a  <  1  then  JP  <  0  <=>  odP  +  (1  —  a  )R  <  a  Q  ■+•  (1  —  <x)R, 

3.  If  ctP  -f-  (1  —  ol)R  <  xQ  -f  (1  —  a )S  for  all  a  e  (0,  1J  then  not  S  <  R. 

Then  there  is  a  real-valued  Junction  u  on  X  that  satisfies  (9.1). 


121 


122 


Expected  (..  'V it?  ft.*  \t>  :ct  Partial  (frt.-t 


foe  three  conditions  in  this  theorem  compare  with  the  three  conditions  oh 
Theorem  8,2.  The  ;>  part  of  condition  2  in  Theorem  9,1  is  condition  2  of 
Theorem  8.2.  The  <-  part  of  condition  2,  which  is  implied  by  the  conditions 
of  Theorem  8.2,  can  be  defended  as  follows.  Suppose  in  fact  that  with 
a  (0,  1)  )ou  prefer  a Q  +  (1  —  a )R  to  aP  4-  (1  —  a)/?.  Then  it  seems 
reasonable  that  tnis  preference  would  depend  on  your  feelings  between  P  and 
Q.  In  fact,  since  the  presence  of  (1  —  a )R  tends  to  weaken  the  difference 
between  the  two  mixtures,  the  removal  of  (1  —  a)R  should  make  the 
distinction  between  P  and  Q  even  clearer  than  that  between  a P  +  (1  —  a)/? 
and  o.Q  +  (1  —  <*)£  and  hence  it  would  seem  reasonable  that  you  would 
prefer  Q  to  P.  In  the  presence  of  the  =>  part  of  condition  2  the  <=  part  can 
Dc  written  as  tot  c  v.v,  i ai — r  U  —  «)A  T  fi  —  a)Aj  nOi  r  ~  £/. 

The  Archimedean  axiom,  condition  3,  is  slightly  different  than  Aumann’s 
axiom,  which  says  that  if  R  <  aQ  +  (1  —  «)5  for  all  a  e  (0,  1]  then  not 
S  <  R.  However,  both  axioms  are  necessary  for  (9.1).  For  example,  if 
<*£(«,  P)  +  (1  —  <x)£(u,  R)  <  a £(u,  Q)  +  (1  —  a)£(i/,  S’)  for  all  a  e  (0,  1], 
then  we  cannot  have  E(u,  S)  <  E(u,  R).  Therefore,  condition  3  is  the 
“weakest”  sort  of  Archimedean  condition  that  can  be  used  to  obtain  (9.1). 

We  note  also  that  condition  3  implies  that  <  is  irreflexive,  and  for  this 
reason  irreflexivity  does  not  need  to  be  included  along  with  transitivity  in 
condition  1. 

Because  indifference  (P  ~  Q  o  not  P  <  Q  and  not  Q  <  P)  is  not  assumed 
to  be  transitive  for  Theorem  9.1,  it  is  not  true  in  general  that  u  satisfying 
(9.1)  is  unique  up  to  a  positive  linear  transformation. 


9.2  CONVEX  SETS  AND  CONES 

This  section  develops  a  theorem  from  which  we  shall  be  able  to  prove 
Theorem  9.1.  The  new  theorem  states  that  if  a  convex  cone  C  in  Re"  satisfies 
specified  conditions  then  there  is  a  tv  e  Re"  such  that  r  •  x  >  0  for  all  xeC. 
We  shall  begin  with  some  definitions  and  two  well-known  lemmas.  In  what 
follows,  X  0 . 

A  set  X  £  Re"  is  convex  if  and  only  if  ax  +  (1  —  a )y  e  X  whenever 
x,  y  e  X  and  0  <.  a  <,  1 .  The  closure  of  a  convex  set  X,  denoted  by  X  as  in 
Section  5.3,  is  easily  shown  to  be  convex  also.  The  topology  with  respect  to 
which  closure  is  defined  is  the  usual  product  topology  tU>"  (Sections  3.4,  5.3). 
The  0  is  the  origin  of  Re". 

LEMMA  9.1.  If  X  G.  Re"  is  convex  and  (y  e  Re",  y  $  X)  then  there  is  a 
iv  /  0  in  Re"  such  that 


inf  {w  •  x:x  e  X}  >  w-  •  y. 


Proof  A- 
1  1  fj  - 


1C  pr«iii 


fpt*?-*  jr  *  <?«rr 


!  he^fCis  -1 


A 


'x  <  A  ;  >  0.  and  it  follows  from  the  definitions  of'  detune 


convexuy  that  there  n  j  i  -  A'  -such  that  iz  •»  yr  —  ntf  fir  -  yf:r  r  f}. 
Let  h  —  -  —  y  Hence  w  ^  0.  With  0  <  k  <  1  and  jt  c  A  .  (1  —  k)r  f- 
kx  e  X.  Hence  ((I  —  aL  -4-  kx  —  y)1  ;>  <z  —  »/)*.  This  reduces  to /(r  —  z)s  +• 
2(2  —  y)  ■  {x  —  z)  ^  0.  Letting  A  approach  zero  it  follows  that  w  ■  x  w  ■  z. 
Since  (z  —  y)  •  (z  —  y)  >  0,  tv  •  2  >  m  •  y.  Hence  inf  {tv  •  x:x  e  X}  i>  n  -  z  > 
w  ■  y.  ♦ 

^  is  on  the  boundary  of  Af  if  every  open  set  that  contains  y  contains  a  point 
in  X  and  a  point  not  in  X. 


LEMMA  9.2,  IfX  £  E.e"  is  convex  and  y  e  Rs. "  is  on  the  boundary  of  X  iken 
there  is  a  w  ^  0  in  Re"  such  that 

inf  {tv  *  x:x  e  X}  —  w  •  y.  (9.2) 

Proof.  Let  y  e  Re"  be  on  the  boundary  of  convex  X  £  Re".  Let  Y  be  an 
open  n-dimensiona!  rectangle  that  contains  y  and  suppose  that  2  e  X  for  all 
ze  Y.  Then,  by  selecting  open  rectangles  included  in  Y  near  the  corners  of 
Y,  each  of  these  must  contain  an  element  in  X,  and  it  follows  from  convexity 
that  there  is  an  open  set  included  in  X  that  contains  y.  But  then  y  is  not  on 
the  boundary  of  X.  Hence,  for  every  such  Y,  there  is  a  point  in  Y  that  is  not 
in  JC.  It  then  follows  that  there  is  a  sequence  ylt  y2,  ■  ■  ■  of  elements  in  Re" 
that  are  not  in  X  but  approach  y.  Then,  by  Lemma  9.1,  there  is  a  sequence 
wx>  w2,  ,  .  .  of  elements  in  Re"  that  differ  from  0,  have  w)  —  1  for  all  j  (after 
multiplication  by  an  appropriate  positive  number),  and  satisfy  inf  {w,-  •  x:x  e 
X}  >  wt  •  yt  for  j  ~  1, 2, . . .  .  Because  wf  =  1  for  all  j  there  must  be  a 
w  e  Re"  such  that  every  open  n-dimensional  rectangle  that  contains  w 
contains  some  w}.  It  follows  that,  for  each  x  eX,  w  •  x  ;>  w  •  y. 
inf  {w  •  x\x  e  X}  >  w  •  y  is  impossible,  for  if  this  were  so  then  w>  •  y  > 
w-y.  ♦ 

Cones 

A  set  X  £  Re"  is  a  cone  if  and  only  if  ax  e  X  whenever  x  e  X  and  a  >  0. 
A  convex  cone  is  a  cone  that  is  convex.  X  is  a  convex  cone  if  and  only  if 
[x,  y  e  X;  a,  >  0]  =>■  cue  +  py  e  X.  0  is  not  necessarily  an  element  in  a 
convex  cone. 

THEOREM  9.2.  Suppose  that  C  is  a  nonempty  convex  cone  in  Re"  and  that 
C  n  (— C)  =  0 ,  where  —  C  =  {x:  —  x  e  C).  Then  there  is  a  we  Re"  such 
that 


w  -  x  >  0  for  all  xeC. 


(9.3) 


124 


Expected  Utility  for  Strict  Partial  Orders 


If  0  g  C  then  0  g  C  and  0  g  — C  so  that  €  n  (-C)  ^  0 .  Hence  the 
Archimedean  condition  C  n  (— C)  =  0  requires  that  0£C.  This  condition 
is  necessary  also  for  (9.3),  for  if  (9.3)  holds  and  z  e  C  n  (— C),  then  w  *  z  <  0 
by  (9.3)  and  hence  (since  z  g  C)  w  •  y  <  0  for  some  y  eC. 

Proof  of  Theorem  9.2.  The  theorem  is  obviously  true  when  n  =  1.  Using 
induction  we  shall  assume  with  n  ^  2  that  the  conclusion  follows  from  the 
hypotheses  for  each  m  <  n.  Thus,  let  the  hypotheses  hold  for  n  2.  Then, 
since  0  is  on  the  boundary  of  C,  it  follows  from  Lemma  9.2  that  there  is  a 
w  e  Re”  with  w  ^  0  such  that 

w  -  x  ^  0  for  all  x  e  C.  (9.4) 

If  (9.3)  holds  for  this  w,  we  are  finished.  Otherwise  w  •  z  =  0  for  some  zeC 
and  in  this  case  we  consider  two  possibilities. 

1.  {x:w  •  x  >  0}  £  C.  Then  C  =  {x:w  •  x  ^  0}  and  with  zeC  and 
wz  =  0,  -zeC  r\  (— C)  in  violation  of  the  Archimedean  condition. 
Hence  this  case  can’t  arise  under  the  hypotheses. 

2.  There  is  an  x  e  Re"  such  that  w  ■  x  >  0  and  x  $  C.  Let  Y  *=  [y:x  •  y  = 
0).  The  dimensionality  of  Y  is  less  than  n  since  x  ^  0  and  if  xt  ^  0  then  each 
y  g  Y  is  uniquely  determined  by  its  other  n  —  1  components.  Also,  each 
z  e  Re"  is  expressible  in  one  and  only  one  way  as  fix  +  y  with  fi  e  Re  and 
y  e  Y.  Namely,  z  —  (z-  x/x*)x  +  [z  —  (z  •  xfx^x],  and  if  z  =  fix  +  y  = 
fi'x  +  i /  with  fi  5<£  fi'  then  x  =  (y  —  y')l(fi'  —  fi),  implying  x*  =  x  •  (y  —  y')/ 
(/S'  —  /S)  =  0,  which  is  false. 

Continuing  with  Case  2  let 

C0  =  {y:/Sa?  +  y  e  C  for  some  y  e  Y  and  /3  e  Re}. 

C0  =  T  is  dearly  a  nonempty  convex  cone.  To  verify  that  <T0  n  (— C0)  — 
0  suppose  to  the  contrary  that  ye€0n  (— C0).  Then  there  is  a  /3  g  Re  such 
that  fix  —  yeC  and  since  y  g  C0  there  is  a  sequence  ylf  yt, ...  in  C  that 
approaches  y  [{y  -  yfifi  ^  (y  -  yi+1f  and  inf  {(y  -  y,)2:/  =  1,  2, . . .}  =  0] 
and  a  sequence  of  numbers  filt  fi2,  ■  •  ■  such  that  fi(x  +  y{e  C  for  all  /, 

Then  (fi  +  fifix  +  y{  —  ye  C  for  all  z  so  that  (fi  +  fit)w  •  x  +  w  •  (yi t  —  y)  ^ 
0  for  all  /  by  (9.4):  hence  the  fit  must  be  bounded  below.  The  fi{  must  be 
bounded  above  also :  otherwise  there  are  x  +  (yjfii)  e  C  that  are  arbitrarily 
close  to  x,  and  this  contradicts  follows  that  there  is  a  A  e  Re  such 

that  inf  (|A  —  fi(\:i  ~  1,2,...}  =  0,  and  since  fi(x  -j-  y(eC  for  all  /  it 
follows  that  Xx  +  ye  C.  But  then  (2*  -f  y)  +  (fix  —  y)  =  (A  +  .fi)x  e  C, 
which  is  false  unless  A  +  fi  *=  0.  But  if  A  +  fi  =  0  then  A x  +  yeC  and 
—Xx  —  ye  C,  contradicting  C  D  (— C)  =  0. 

Therefore  CQ  n  (— C0)  =  0.  It  follows  from  the  induction  hypothesis  for 
m  <  n  that  there  is  a  v  e  Y  such  that  v  •  y  >  0  for  every  y  e  C9 .  Since  ve  Y, 


Proof  of  Theorem  9.1 


125 


v  •  x  as  0  and  therefore,  for  each  zeC  written  as  z  =  /Sx  +  y  in  the  C0 
format,  v  •  2  =  v  •  (fix  +  y)  —  v  •  y  >  0.  4 


9.3  PROOF  OF  THEOREM  9.1 

Throughout  this  section  the  hypotheses  of  Theorem  9.1  are  assumed  to 
hold  along  with  P  <  Q  for  some  P,  Q  g  if,,  for  otherwise  the  conclusion  is 
obvious. 

Let  X  have  n  +  1  elements,  n  1 ,  identified  as  xlt  xt . xn+l.  For  each 

Pe  3',  let  pt  =  P(xt).  Let  $  =  {p  =  (pu  . . .  ,/>„):/><  ^  0  for  each  i  and 
H.Pi<,  1}.  Then  there  is  a  one-to-one  correspondence  between  JT,  and 
£f  £  Re".  In  terms  of  $  the  conditions  are: 

1.  (p<q,q<  r)=>p  <  r, 

2.  If  0  <  a  <  1  then  p  <  q  o  a.p  +  (1  —  a)r  <  aq  +  (1  —  a)r, 

3.  If  ap  +  (1  —  a)r  <  otq  +  (1  —  a )s  for  all  a  e  (0, 1]  then  not  s  <  r. 

Define  D  e  Re"  by 

D  =  {t: t  =  p  —  q  for  some p,qe$  for  which  q  < p).  (9.5) 

Clearly,  (9.1)  holds  if  and  only  if  there  is  a  tv  e  Re”  such  that  tv  •  t  >  0  for 
every  t  e  D.  Some  facts  about  D  follow. 

a.  Suppose  t  e  D  is  such  that  t  =  p -q  =  r - s  with  q  <  p.  Then 

\r  -f  \q  =  ip  +  is,  ir  +  iq  <  \r  +  ip  by  condition  2,  and  therefore 

ip  +  \s  <  ir  4-  ip,  so  that  s  <  r  by  condition  2  (<=).  Hence  if  q  <p 

then  s  <  r  whenever  r  —  s  =  p  —  q. 

b.  Suppose  /  =  p  —  q  and  t*  =  r  —  s  are  in  D.  Then  q  <p  and  s<r. 
Hence,  by  conditions  1  and  2,  aq  +  (1  —  a)s  <  a/>  +  (1  —  a )r  for  any 
a  g  (0,  1),  and  hence  a/  +  (1  —  a )t*  e  D,  Thus  D  is  convex. 

c.  If  t  ss  p  —  q  far  some  p,qe  $  then  te  Doxte  D  for  all  a  e  (0,  1). 
This  follows  from  condition  2. 

d.  If  t  =  q  —  pandt*  =  s  —  rforp,q,r,se$andiftnt  +  (1  —  a )t*  e  D 
for  all  *  e  (0, 1]  then  —t*  $  D.  To  prove  this  observe  that  a /  4- 
(1  -  a)t*  e  D  implies  that  ap  +  (l  —  a)r  <  <xq  +  (l  —  a)s  by  (a). 
Then,  by  condition  3,  not  s  <  r.  Hence,  again  using  (a)  with  —t*  = 
r-s,-t*$D. 

Based  on  D  we  define  a  cone  C  as  follows: 

C  =  {x:x  =  a t  for  some  a  >  0  and  t  e  D }. 

Since  D  ft  0  by  assumption,  C  ft  0 .  The  convexity  of  C  follows  easily 
from  properties  (b)  and  (c)  for  D.  For  the  Archimedean  condition  we  wish 
to  have: 

tut  4-  (1  —  a)f*  eC  for  all  «  e  (0, 1]  =>  -t*  $  C.  (9.6) 


126 


Expected  Utility  for  Strict  Partial  Orders 


This  is  obviously  true  if  /*  =  0.  Henceforth  take  V *  *  0.  If  /  e  C  it  is  easilv 
seen  (Exercise  11)  that  there  is  a  ft  >  0  such  that  fit  e  D  and  (fit)*  >  ]/»• 

iven  at  +  (1  —  a )/*  e  C  for  all  a  e  (0,  1],  it  follows  that  for  each  a  e  (0  11 
there  is . «  ft*)  >  0  such  that  «(/?(«)/)  +  (1  -  x)(fi(*)t*)  e  D  and  fi(a)\J  + 

fir**  •  S‘n?  {(a/  +  (1  “  1]}  is  bounded  above, 

it  ^Jowsthat  fi(a)  >  d  for  some  6  >  0  and  all  a  6  (0,  1].  Therefore  there  is  a 

2ha;  +  ^  ~  e  D  for  all «  e  (0, 1].  With  fi  such  that 

S?.  *  ^.}tn  11  fol,ows  from  W)  ‘hat  -fit*  $  D  and  hence  that  -t*  <£  C 
This  verifies  (9.6).  ^ 

Suppose  C  £  Re"  is  actually  n  dimensional  so  that  some  teCk  not  on  the 
boundary  of  C.  Then  there  is  an  open  n-dimensional  cube  in  Re"  that 
contains  such  a  t  and  is  included  in  C.  It  follows  with  little  difficulty  that  if 
*  e  C  then  at  +  (1  -  «>  e  C  for  all  a  6  (0, 1],  and  hence  -z$C  by  (9.6) 
Hence  ^  n  (-C)  =  0  and  it  follows  from  Theorem  9.2  that  there  is  a 
we  Re  such  that  w  •  t  >  0  for  all  t  e  C  and  hence  w  •  t  >  0  for  all  /  e  D 
If  every  point  in  C  is  on  the  boundary  of  C  (with  respect  to  Re")  then  the 
dimensionality  of  C  is  less  than  n  and  a  similar  analysis  applies  with  respect 
to  the  actual  dimensionality  of  C.  +  ^ 

9.4  SUMMARY 

C  £  Re"  is  a  convex  cone  if  [*,  y  e  C;  «,  fi  >  0]  =>  a*  +  fiy  6  C  If 
C  is  a  nonempty  convex  cone  in  Re",  and  -C  and  the  closure  of  C  have  no 
point  in  common,  then  there  is  a  w  e  Re"  such  that  w,*,  +  •  •  ■  +  w  x  n 
for  every  x  in  C.  This  result  can  be  used  to  prove  that  if  X  is  finite”  and  if 
<  on  0*.  is  a  strict  partial  order  that  satisfies  an  appropriate  independence 
condition  and  a  necessary  Archimedean  condition,  then  there  is  a  real- valued 
function  u  on  X that  satisfies  P<Q^  E(u,  P)  <  E(u,  Q)  for  all  P  and  Q  in 


INDEX  TO  EXERCISES 


Jm!"^KdenCe  rdJti0nS-  2"3'  Conditions  that  imply  transitive  indifference  4 

QoP  =  Q.  6-7.  Convex  sets  and  closure.  8.  Limit  point 
9-10.  More  boundaries.  1 1-12.  Distance  from  the  origin.  13.  Linear  additivity.  ^ 


Exercises 

_  **  ^ith  <  on  ** a  strict  Partial  order  let  P  ~  Q  o  (not  P  <  Q,  not  Q<  P )  and 
P~Qo(P~*oQ~Rt  for  all  R  s  tr.),  as  usual.  Let  Bl, B2, *3  be  ffie 


Exercises 


127 


following  independence  conditions: 

51.  [P  <  Q,  0  <  a  <  1]  =>  *P  +  (1  -  a )R  <  xQ  +  (1  -  a)5. 

B2.  [5  ~  Q,  0  <  a  <  1]  =>  aP  +  (1  -  ol)R  ~  *Q  +  (1  -  ol)R. 

53.  [P  *  Q,  0  <  «  <  1]  =>  <xP  4-  (1  -  a )R  «  clQ  +  (1  -  a )R. 

Express  your  opinions  on  the  reasonableness  of  B2  and  53,  show  that  (SI,  S2) 

imply  the  converse  (<=)  of  each  of  Bl,  B2,  and  S3  (with  0  <  a  <  1),  and  construct 
a  specific  example  to  show  that  SI,  B2,  and  S3  do  not  imply  that is  transitive. 
Assume  throughout  that  <  on  sf,  is  a  strict  partial  order. 

2.  ( Continuation .)  Let  Cl  and  C2  be  respectively  the  semiorder  conditions 
{P  <  Q,  Q  <  R)=>  (P  <  SoiS  ■<  R)  and  (P  <  Q,  R<S)=>(P  <SotR<  Q ). 
Assume  that  •<  is  irreflexive. 

a.  Construct  situations  that  question  the  reasonableness  of  Cl  and  C2. 

b.  Show  that  (SI ,  Cl)  =>  ~  is  transitive. 

c.  Show  that  (SI ,  S2,  C2)  =>■  <—  is  transitive. 

3.  ( Continuation .)  Let  54  be  the  condition:  (P  ~R,Q  ~R,  0  <  a.  <  1)  => 
aP  +  (1  —  ol)Q  ~  R.  Show  that  54  is  implied  by  strict  partial  order  and  51  provided 
that  P  <  Q  or  Q  <  P.  Then  prove  that  (strict  partial  order,  51,  52,  54)  =>  ~  is 
transitive.  Can  you  construct  a  situation  that  questions  the  reasonableness  of  54? 
If  so,  what  is  it? 

4.  Aumann  (1962)  proves  that  if  on  for  finite  AT  is  a  quasi  order  (reflexive, 

transitive),  if P  <*  Q  <=>  a.P  +  (1  —  a)R  a£)  +  (1  —  a)5 whenever 0  <  oc  <  1, 
and  if  R  <*  <sP  +  (1  -  a.)Q  for  all  a  6  (0,  1]  =>  not  Q  <*  R,  n  there  is  a 

real-valued  function  u  on  X  such  that,  for  all  P,Qe  P  <*  Q  =>  E(u,  P)  < 

E(u,  Q )  and  P  Q  =>  E(u, P)  =  E(u,  Q).  Here  P  <*  Qo(P  <*  Q,  not 
£)<*/*)  and  P  Qo(P  Q,  Q  P).  Now  assume  that  -<  on  (Tt  is  a  strict 
partial  order  that  satisfies  51,  52,  and  53  of  Exercise  1  along  with  R  <  *  P  + 
(1  —  a )Q  for  all  a G  (0, 1]  =>  not  Q  ■<  R.  Defining  from  -<  by  P  Qo 
(P  <Q  or  P  Q),  show  that  =<*  satisfies  Aumann’s  conditions  and  hence  that 
there  is  a  real-valued  function  u  on  X  (finite)  such  that  (9.1)  holcL  along  with 

P  «  Q  =>  E{u,  P)  -  £(«,  Q),  for  all  P,Qe 

5.  Suppose  X  “  {$1,  $2, . . . ,  $100},  with  $1  <  $2  -<  •  •  •  -<  $100.  Argue  that 

with  defined  as  in  Exercise  1 ,  it  would  not  be  unusual  to  find  that  P  <=*  Q  o  P  = 
Q  when  •<  on  iTa  is  a  strict  partial  order.  Can  you  think  of  a  case  (with  elements  in 
X  not  monetary)  where  P  Q  would  seem  reasonable  for  some  P,  Q  with  P  #  £>? 

6.  Prove  that  if  A”  £  Re"  is  convex  then  so  is  X. 

7.  Show  that  if  A1  s  Re"  is  convex  and  (y  e  Re",  y$X)  then  there  is  a  ze  X 

such  that  (z  —  y)*  =  inf  {(x  —  y)*:x  e  .S'}. 

8.  Let  Wj  G  Re"  be  such  that  «  1  for  j  *=  1 , 2 . Prove  that  there  is  a 

w  G  Re"  such  that  every  open  n-dimensional  rectangle  that  contains  w  contains 

some  wf. 


129 


Expected  Utility  fee  Strict  Partiml  Orders 


9.  Describe  the  boundaries  of  the  following  convex  sets  in  Re*:  (a)  {*:**+**  £ 

1}>  (b)  {x'xl  +  <  1},  ( c )  {«:0  <  <1,0  ^  ^  1},  and  (d)  {x-.x  ■=  (a,  a)  for 

«e(0, 1]}. 

10.  With  X  a  convex  set  in  Re"  suppose  that  /  e  X  is  not  on  the  boundary  of  X 
Verify  that  if  *  e  X  then  at  +  (1  -  a)z  e  X  for  all  «  e  (0, 1  ]. 

1 1 .  With  D  as  defined  by  (9.5)  suppose  t  e  D.  Define p,  q  e  a*  as  follows :  (p{,  qt)  « 
(tit  0)  or  (0,  tf)  or  (0, 0)  according  to  whether  r<  >  0  or  r*  <  0  or  t{  =  0.  Then 
/L—  9  “  Now  multiply  every  tt  by  a  >  0  with  a  as  large  as  possible  so  that 

^  1  and  2«:ti<o}  **<  ^  —  !•  Then  an,  an  6  3*  and  at  =  ap  —  out  is  in 

D.  Verify  that  ^{Li  (at,)*  ^  l  /«*. 

12.  ( Continuation .)  Verify  that  ^”-i  (af,)*  ^  1/n. 

13.  Argue  from  the  theory  in  this  chapter  that  if  X  is  the  non-negative  orthant 
of  Re"  and  if,  for  all  x,  y,z,we  X, 

a.  -<  is  transitive, 

b.  If  a  e  (0,  1)  then  x  -<  i/o  ax  +  (1  -  a)z  <  ay  +  (1  -  a)z, 

c.  ax  +  (1  -  <t)y  <  oz  +  (1  -  a)w  for  all  a  £  (0,  1]  =>  not  w  <  y ,  then  there 

are  real  numbers  ^ 4  such  that  a  <  y  =>  JjLi  <  2?_i  for  all 
x,yeX.  What  must  be  true  of  the  2,  if  (1)  (x(  <  Vi  for  all  i,x  *y)=>x  <y 
(2)  (xf  <  y,  for  all  /)  =>  x  <  $/  ? 


Chapter  10 


EXPECTED  UTILITY  FOR 
PROBABILITY  MEASURES 


This  chapter  extends  the  weak  order  expected-utility  theory  of  Chapter  8  to 
more  general  sets  of  probability  measures.  Since  the  sets  of  measures  con¬ 
sidered  are  mixture  sets,  Theorem  8.4  will  be  used  as  a  base  for  establishing 
the  representation  P  <  Q  o  E(u,  P)  <  E(u,  Q).  Conditions  that  go  beyond 
those  of  Theorem  8.4  are  required  for  the  extensions.  The  primary  new 
condition  says  that  if  a  measure  P  is  preferred  to  every  consequence  in  a  sub¬ 
set  V  of  consequences  for  which  Q(  Y)  =  1 ,  then  Q  shall  not  be  preferred  to 
P. 

After  two  preliminary  examples,  Sections  10.2  and  10.3  develop  necessary 
background  material  on  probability  measures  and  expectations.  The  actual 
utility  theory  development  begins  in  Section  10.4. 


10.1  TWO  EXAMPLES 

In  our  first  example,  a  decision  maker  must  decide  between  two  construc¬ 
tion  procedures,  A  and  jB,  for  building  a  bridge  over  a  river.  Procedure  A 
will  cost  SI  SO  million  and  B  will  cost  $100  million.  For  A  engineers  have 
estimated  the  probability  P(t)  of  completing  the  bridge  by  t  years  from  now 
at  0  for  /  <,  2  and  (t  —  2)/3  for  2  <,  t  <,  5.  For  B,  the  probability  Q(t )  of 
completion  by  /  years  from  now  is  estimated  at  0  for  f  ^  3  and  (t  —  3)/4  for 

The  decision  maker's  utilities  for  the  applicable  consequences  are  estimated 
according  to  the  expected-utility  model  as  u($150,  /)  =  —  (/  —  2)*  —  5  for 
procedure  A,  and  as  u($100,  t)  =  —(t  —  2)*  for  procedure  B.  The  expected 
utility  of  A  is  therefore 

—  2)*  —  53(1/3)  dt  =  —8 


129 


130 


Expected  Utility  for  Probability  Measures 


and  for  B  the  expected  utility  is 

JV(*-*2fl(l/4)df-  -10.33. 

Thus  procedure  A,  more  costly  but  faster  than  B,  has  the  greater  expected 
utility. 

The  St.  Petersburg  Game 

The  often-discussed  “St.  Petersburg  game”  from  Bernoulli  (1738)  gives  an 
example  of  a  discrete  probability  measure.  Consider  a  sequence  of  coin 
tosses  and  let  «n  be  the  probability  that  a  “head”  occurs  for  the  first  t:me  at 
'he  nth  loss.  Suppose  you  believe  that  a„  —  2~n  tor  n  =  1,2,...  and  are 
given  a  choice  between  “Don’t  play”  and  “Pay  the  house  $100  and  gee  back 
$2"  if  the  first  head  occurs  at  the  nth  toss.” 

Let  X  be  amounts  of  money  representing  changes  in  your  present  wealth. 
Then,  with  u  defined  on  X, 

Expected  utility  of  “Don’t  play”  =  u($0) 

Expected  utility  of  “Pay  and  play”  =  Xb-l  u($2n  —  SlOO'te-”. 

According  to  the  theory  given  later,  u  on  X  is  bou  ,ded.  Suppose,  for  example, 
that  u(x)  as  xl{\x\  -f-  10000),  so  that  —  1  <  u(x)  <  1  for  all  x.  Then  «($0)  =  0 
and  ^  u(S2n  —  $100)2~n  <  0  so  that  “Don’t  play”  has  the  greater  expected 
utility. 

10.2  PROBABILITY  MEASURES 

Generally  speaking,  probability  measures  are  defined  on  Boolean  algebras 
of  sets.  In  the  following  definition  Ac  —  {x:x  e  X,  x  f  A],  the  complement  of 
A  with  respect  to  X,  and 

00 

U  Ai  =  {x:xe  A{  for  some  ie  { 1,  2, . . .}} 

Definition  10.1.  A  Boolean  algebra  A  for  X  is  a  set  of  subsets  of  X  such 
that 

1.  XeA, 

2.  AeA=>A*eA, 

3.  A,  B  e  A  =>  A  u  B  e  A. 

A  o-algebra  A  for  X  is  a  Boolean  algebra  that  satisfies 

4.  A{eA  for  /  =  1 , 2, . . .  =>  U£i  A,  e  A. 

{0 ,  X)  is  the  smallest  Boolean  and  o-algebra  for  nonempty  X.  The  largest 


Probability  Measures 


131 


Boolean  and  0-algebra  is  the  set  of  all  subsets  of  X.  For  reasons  that  will 
become  clearer  later  we  shall  usually  assume  that  {x}  e  A  for  each  x  e  X. 

If  X  is  finite  then  every  Boolean  algebra  is  a  a-algebra.  The  difference 
between  these  two  arises  when  X is  infinite  and  it  has  some  affect  on  properties 
cf  probability  measures.  Some  authors,  such  as  Lofcve  (1960),  deal  exclusively 
with  a-algebras  (or  “0-fields”). 

If  C  is  an  arbitrary  set  of  subsets  of  X,  the  Boolean  algebra  generated  by  C 
( minimal  Boolean  algebra  over  C)  is  the  intersection  of  ail  Boolean  algebras 
that  include  e.  The  o-algebra  generated  by  C  is  the  intersection  of  all  0-algebras 
that  include  C.  It  is  easily  verified  that  the  intersection  of  a  set  of  Boolean 
(o')  algebras  for  X  is  &  Boolean  (o)  algebra  for  X . 

With  X  *=  {1, 2, . . .}  and  C  —  {{1},  (2), . . the  set  of  all  unit  subsets  of 
X ",  the  Boolean  algebra  X  generated  by  C  is  the  set  of  all  subsets  of  X  that 
are  either  finite  or  contain  all  but  a  finite  number  of  elements  in  X.  But  X 
is  not  a  a-algebra  since  it  doesn’t  contain  the  set  of  all  even,  positive  integers. 
The  a-algebra  generated  by  C  is  the  set  of  all  subsets  of  X. 

Let  X  =*  Re,  with  C  the  set  of  all  intervals  in  Re.  The  a-algebra  generated 
by  C  is  called  the  Bore/  algebra  for  Re,  and  its  elements  are  Borel  sets.  There 
are  subsets  of  Re  that  are  not  Borel  sets:  see,  for  example,  Halmos  (1950, 
pp.  66-72). 

Throughout  the  rest  of  this  chapter ,  A  denotes  an  algebra  C Boolean  or  a) 
for  X. 

Probability  Measures  and  Countable  Convex  Combinations 

Definition  10.2.  A  probability  measure  on  A  is  a  real-valued  function  P 
on  A  such  that 

1.  P{A)  ;>  0  for  every  A  e  A, 

2.  P(X)  =  1, 

3.  [A,  B  e  A,  A  n  B  =  0  J  =>P(A  u  B)  =  P(A)  +  P(B). 

For  further  definitions  we  shall  use  the  standard  notation 

=  supf^A-w  =  1,2, . . .}  (10.1) 

l<-i  ) 

when  pi^O  for  all  i  and  ]££,  p{  M  for  some  M  and  all  n  =  1 , 2, . . .  . 
Since  J*  » =  1  -  2~\  ££,  2r*  -  1. 

Definition  10.3.  If  P,  is  a  probability  measure  on  A  and  ^  0  for 
/a  1,2,...,  and  if  J®, a*  =  1,  then  2*®,  <M\  is  the  function  on  A  that 
assigns  the  number  2,“i  *iPi(A)  to  each  A  e  A. 

The  proof  of  the  following  lemma  is  left  to  the  reader. 

LEMMA  10.1.  aiP%  os  defined  in  Definition  10.3  is  a  probability 
measure  on  A. 


132  Expected  Utility  for  Probability  Measures 

The  next  definition  will  be  used  in  our  utility  theory. 

Definition  10.4.  A  set  O'  of  probability  measures  on  ft  is  dosed  under 
countable  convex  combinations  if  and  only  if  1  e  O'  whenever  P,e!T 
and  a,  ^  0  for  i  —  1,2,...,  and  t  «,  =  1 . 

If  (T  is  closed  under  countable  convex  combinations  then  O'  is  a  mixture  set 
(Definition  8.3).  Hence  if  <  on  O’  satisfies  >41,  >42,  and  >43  of  Theorem  8.3 
then  (8.5)  and  (8.6)  hold  and  u  on  O’  is  unique  up  to  a  positive  linear  trans¬ 
formation. 

Countably- Additive  Probability  Measures 

Definition  10.5.  A  probability  measure  P  on  A  is  countably  additive  if 
and  only  if 

=%P(Ai)  (10.2) 

v=i  ;  .= i 

whenever  A{  e  A  for  i  —  1,2,...,  (J<-i  Afe  A  and  >4,  n  A,  =  0  when 
1  tA  J- 

This  applies  whether  A  is  a  0-algebra  or  a  Boolean  algebra  that  is  not  also 
a  o-algebra.  (10.2)  is  an  extension  of  Definition  10.2  (3). 

Let  JL  be  the  Boolean  algebra  generated  by  C  =  {{1},  {2}, . , and  let  P 
on  be  defined  on  the  basis  of  P(n)  =  2~n  for  each  n  e  X  =  {1 , 2, . . .}.  P 
is  countably  additive  but  Ji  is  not  a  0-algebra. 

Let  A  be  the  set  of  all  subsets  of  (1 , 2, . .  .}  and  let  P  on  A  be  any  proba¬ 
bility  measure  that  has  P(n)  =  0  for  each  *€{1,2,...}.  Then  A  is  a  0- 
algebra  and  P  is  not  countably  additive.  Dubins  and  Savage  (1965)  call  any 
measure  that  assigns  probability  i  to  a  denumerable  subset  of  X  and  proba¬ 
bility  0  to  every  unit  subset  diffuse.  The  uniform  measure  on  the  positive 
integers,  with  P{n)  —  0  for  n  =  1,  2, . . .  and  P({n,  2 n,  3 n,  .,.})=  1/n  for 
n  =  1 , 2, . . . ,  is  diffuse. 

Let  A  be  the  set  of  all  Borel  sets  in  [0, 1  ],  and  let  P  be  the  uniform  measure 
on  A  defined  on  the  basis  of  P([a,  b])  =  b  —  a  when  0  a  b  1.  This 
P  is  a  countably-additive  measure  on  a  0-algebra. 

An  important  property  of  countably-additive  measures  is  noted  in  the  next 
lemma. 

LEMMA  10.2.  If  P  on  A  is  countably  additive ,  if  3!>  is  a  countable  subset  of 
A  whose  elements  are  weakly  ordered  by  <=■ ,  and  if  (J.35  As  A  then 

P((Ja  A)  am  sup  {P(A):A  e  58}.  (10.3) 

Proof  The  conclusion  is  obvious  if  38  is  finite.  Assume  then  that 
3i  i$  denumerable,  enumerated  as  Alt  >4t,  >4„  . . .  .  Let  Cn  U£-i 
Then  Cj  £  Cj  £  Cj  E  *  •  * ,  (J®  4  **  (J*,j  Cj  and  sup  {P(/f)i4  €  3}  ** 
sup  {P(Cn):n  =  1,2,...}.  This  last  equality  follows  from  the  facts  that  for 


Probability  Measures 


113 

any  A  e  &  there  is  an  n  such  that  P(C„)  ^  P(A)  and  that  for  any  Cn  there  is 
an  A  e  &  such  that  P(A)  P(Cn). 

Let  Dx  —  Cj  and  2>,  =  C(  —  Ct.. j  (set  theoretic  subtraction)  for  i  = 
2, 3, ....  so  that  tj  Dt  «*  U  C<t  A  n  D,  =  0  whenever  /  /,  and  C„  * 

Ur.t  A-  Then 

P(Ua  >0  -  iP^U  a)  since  U»  A  =  0  A 

=  2  P(  A)  by  countable  additivity 

i-i 

=  sup  iP(  D<) :  n  =  1 ,  2, . .  .|  by  definition 

=  sup  {P(Cj:n  ■=  1,2,...}  by  finite  additivity 
=  sup  {P(A):A  e  55).  ♦ 

Discrete  Probability  Measures 

Definition  10.6.  A  probability  measure  P  on  A  is  discrete  if  and  only  if 
{x}  e  A  for  each  *  €  X,  A  is  a  a-algebra,  P  is  countably  additive  and  P(A)  —  1 
for  some  countable  A  e  A. 

All  simple  measures  are  discrete.  Nonsimple  discrete  measures  on  the  set 
of  all  subsets  of  .¥={0,1,2,...}  include  the  geometric  distributions 
[P(n)  =  p(l  —  p)n,  0  <  p  <  1]  and  Poisson  distributions  [P(n)  ~  e~yAn/n!t 
A  >  0].  The  following  lemma  compares  with  Theorem  8.1. 

LEMMA  10.3.  If  P  on  A  is  discrete  then  P[x)  —  0  for  all  but  a  countable 
number  of  x  e  X  and 

P(A)  =  2  P(*)  for  all  A  e  A.  (10.4) 

Proof.  Let  A  be  countable  with  P(A)  =  1  Then  P{x)  =  0  for  every 
xeAc  for  otherwise  P(A  u  {x})  >  1  for  some  x  e  Ae.  (10.4)  follows  from 
(10.2)  when  A  is  countable.  (10.4)  holds  in  general  if  P(C)  —  0  when  P(x)  =  0 
for  all  x  e  C  and  C  e  A.  Let  D  =*  (x:x  e  X,  P(x)  >  0}.  If  P(D)  <  1  it  follows 
from  (10.4)  for  countable  sets  that  P(A)  <  1  for  every  countable  A  €  A,  a 
contradiction.  Hence  P(D)  =  1 .  Then  P(C)  —  0  when  C  n  D  *=  0 .  + 

Lemma  10.3  shows  that  a  discrete  measure  is  completely  described  by  the 
point  probabilities  P(x). 

Conditional  Probability  Measures 

Definition  10.7.  If  P  on  A  is  a  probability  measure  and  if  A  €  A  and 
P(A)  >  0  then  the  conditional  measure  of  P  given  A,  written  PA,  is  the 


134 


Expected  Utility  for  ProbmbUlty  Meatmret 


function  defined  by 

PA(B)  *  P(B  n>  A)/P(A)  for  all  Be  A.  (10.5) 

When  PA  is  well-defined,  it  is  a  probability  measure  on  A:  if  P  is  countably 
additive  then  so  is  PA.  Pa(A)  *=  1,  PA(B)  **  1  if  A  £  Be  A,  PA{B)*= 
P(B)}P{A)  if  B  <=  A  and  B  e  A.  If  A ,  B  e  A  and  P(A)  >  0  and  P(B)  >  0  then 

P(A)Pa(B)  =  P(B)PS(A)  *  P(,4  n  5). 

Pa{B)  can  be  interpreted  as  the  probability  that  the  consequence  that 
occurs  will  be  in  B,  given  that  the  consequence  that  occurs  will  be  in  A.  If 
B  n  A  *  0  then  PA(B)  =  0. 

If  P(A)  >  0  and  P(AC)  >  0  then  (note  the  convex  combination) 

P  *=  P(A)Pa  +  P(A‘)Pa .  (10.6) 

since,  for  any  Be  A,  P(B)  —  P{B  A(Xu  Ac))  =  P((B  C\  A)kj  (B  r\  >4*))  = 
P(B  n  A)  +  P(B  H  A1)  =  P(A)P(B  n  A)/P(A)  4-  P(Ae)P(B  O  Ae)j 
P(Ae)  =  P(A)Pa(B)  +  P(Ae)PA'(B).  More  generally,  if  P(A)  =  1  (with 
A  e  A),  if  {Ait . . . ,  An)  is  an  ^-partition  of  A,  and  if  /  =  {i:P(A()  >  0},  then 

P  =  I  P(A,)P 4,  (10.7) 

I 

since,  for  any  Be  A, 

I  P{A^PAi{B)  =  lP(Bn  At)  by  (10.5) 

/  i 

=  i  P(B  n  At) 

i-l 

P^U  B  n  by  finite  additivity 

=  P(B  n  A)  =  P(B  0/4)  +  P(B  O  Ac)  =  P(B). 

(10.7)  holds  also  when  P  is  countably  additive,  A  e  A  and  P(A)  =  1 ,  and 
{Ax,  At, . . .}  is  a  denumerable  ^-partition  of  A  and  /  =  (i:P(Af)  >  0}. 

The  following  definition  will  be  used  in  our  utility  development. 

Definition  10.8.  A  set  IT  of  probability  measures  on  A  is  closed  under  the 
formation  of  conditional  probabilities  if  and  only  if  [P  e  O’,  A  e  A,  P(A)  >  0]  => 
P^eff. 

10.3  EXPECTATIONS 

This  section  defines  precisely  the  expected  value  £(f,  P)  of  a  bounded, 
real-valued  function  /  on  X  with  respect  to  a  probability  measure  P  on  A. 
In  general  we  shall  assume  that  /  is  A-mcasurable. 


Expectations 


135 


Definition  10.9.  /  is  A-measurable  if  and  only  if  /  is  a  real- valued 
function  on  X  and  {x:f(x)el}  e  A  for  every  interval  /  fc  Re. 

^measurable  functions  are  sometimes  called  random  variables,  but  it  is 
more  common  to  use  this  term  for  functions /on  X  for  which  {*:/(#)  e£}eA 
for  every  Borel  set  B  e  Re. 

To  define  expectation  we  begin  with  simple  A-measurable  functions. 

Definition  10.10.  An  A-measurable  function  /  is  simple  if  and  only  if 
{f(x);x  p  X }  is  finite.  If /is  simple  and  takes  on  n  distinct  values  clt . . . ,  cn 
with  fix')  —  ci  for  all  xeA{  then  each  AteA  by  Definition  10.9  and 
{Alt ....  An }  is  a  partition  of  X:  with  P  a  probability  measure  on  A,  we 
then  define 

£(/,P)~iciP(.4,).  (10.8) 

t-1 

Simple  A-measurable  functions  are  bounded.  In  general,  an  A-measurablc 
function  f  is  bounded  if  and  only  if  there  are  numbers  a  and  b  for  which 
a  <,/{£)■£  b  for  all  *  e  X.  In  defining  E(f,  P )  for  any  bounded,  .^.-measur¬ 
able  / we  shall  use 

Definition  10.11.  A  sequence  f\,f%, ...  of  simple  A-measurable  functions 
converges  uniformly  from  below  to  an  ^-measurable  function /if  and  only  if, 
for  all  x  €  X, 

1.  /,(*)  £/>(*)  S  •  •  ■ 

2.  f(x) «  sup  {/„(*):»  _  !,  2, .  . .) 

3.  For  any  e  >  0  there  is  a  positive  integer  n  (which  may  depend  on  e) 
such  that  f(x)  <,fnix )  +  e. 

For  any  bounded,  A-measurable  /  there  is  a  sequence  of  simple  A- 
measurable  functions  that  converges  uniformly  from  below  to  f  With 
X  =  {x:xe  X and  a  <,fix )  <!  b }  and /.4-measurable  let 

A,n  =  {■*:<*  <,fix)  <  <*  +  ib  -  a)/n) 

Ain  —  {f-a  +  (i  —  1  )(b  —  a)/n  <  fix)  <:  a  +  i \b  —  a)/n}  /  =  2, . . .  n, 

(10.9) 

and  define /„  by 

/,(*)  ~  a  -F  (i*  -  l)(fi  —  a)/n  for  all  x  s  /lin,  i  »  1, . . .  ,  n. 

(10.10) 

Each  Ai  n  e  A  by  Definition  10.9  and  therefore  each  /„  is  a  simple  A- 
measurable  function.  Conditions  1  and  2  of  Definition  10.11  are  easily 
verified  and  condition  3  holds  with  n  (b  —  a)[e. 


136 


Expected  Utility  for  Probability  Measures 


Definition  10.12.  If/  is  bounded  and  ^-measurable  and  if  P  is  a  proba¬ 
bility  measure  on  A  then 

£(/,  P)  -  sup  {£(/„,  P):n  =  1,2,...}  (10.11) 

where  /x,/2, ...  is  any  sequence  of  simple  ^-measurable  functions  that 
converges  uniformly  from  below  to/. 

The  following  lemma  notes  that  £(/,  P )  is  well  defined. 

LEMMA  10.4.  If  fi,fn, . . .  and  gugs, ...  are  sequences  of  simple  A- 
measurable  functions  that  converge  uniformly  from  below  to  a  bounded, 
A-measurable  function  f  then  sup  {£(/„,  P):n  =  1,2,...}  is  finite  and 

sup  {£(/„,  P)'.n  =  1,2,...}  =  sup  {E(gn,P):n  =  1,2, . . .}.  (10.12) 

Proq/l  Boundedness  assures  a  finite  sup.  To  verify  (10.12)  assume  to  the 
contrary  that  sup  {£(/„,  P):n  =  1, 2, . . .}  <  sup  {£(g„,  P):n  —  1,2,...}. 
Then  there  is  an  e  >  0  and  a  positive  integer  m  such  that 

£(/„,  P)  +  €  <  E(gn,  ,P)  for  »  -  1 , 2, ...  .  (10.13) 

By  condition  3  of  Definition  10.11  there  is  a  fc  such  that  f(x)  <fk(x)  +  «  for 
all  x  eX,  so  that  E(h,  P)  <,  E{fk  ■+■  e,  P)  for  every  simple  ^-measurable  h 
for  which  h{x)  <,f(x)  for  all  x  e  X.  In  particular,  E(gm,  P)  <,  E{fk  +  «,  P)  = 
E(fk ,  P)  +  c,  contradicting  (10.13).  ♦ 

£(/,  P^)  for  a  well-defined  conditional  probability  measure  PA  is  defined 
as  above  since  PA  is  a  probability  measure  on  A. 

Finite  versus  Countable  Additivity 

The  uniformity  condition  3  of  Definition  10.11  is  superfluous  for  defining 
£(/,  P)  when  P  is  countably  additive  and  A  is  a  <r-algebra  (Exercise  15).  But 
uniform  convergence  is  required  when  countable  additivity  is  not  assumed  to 
hold.  The  following  illustrates  what  amounts  to  the  failure  of  (10.3)  for  a 
diffuse  measure. 

Let  X  —  {0,  1, 2, . . let  A  be  the  set  of  all  subsets  of  X  (a  a-algebra), 
let  P  be  any  probability  measure  on  A  that  has  P(x )  =  0  for  all  x  eX,  and 
let  f(x)  —  xf{  1  +  a:)  for  all  x  e  X. 

Since  0  <,  f(x)  <  1  on  X  we  can  let  a  =  0  and  b  =  1  in  (10.9)  and  (10.10) 

to  obtain  £(/„,  P)  =  [(/  -  \)jn}P{Ai  n)  =  (n  -  !)//»  for  n  =  1, 2 . 

since  Ai  n  is  a  finite  set  for  all  i  <  n  and  therefore,  by  finite  additivity, 
P(A,  n)  —  0  for  all  i  <  n.  Since  ft,ft, . . .  converges  uniformly  from  below  to 
/,£(/,£)=  1. 

Now  consider  a  sequence  glf  g2, . . .  that  converges  from  below  to  g,  but 


Preference  Axioms  sunt  Bomssded  Utilities 


137 


not  uniformly.  In  particular  let  Bl  n  =  [0,  l/«]  u  ((»  —  l)/n,  1)  and  B(  n  = 
(( i  —  1  )jn,  i/n]  =  Ai  n  for  /»  2, —  1 ,  and  define  g „  by 

g„(x)  =  inf  for  all  *  e  n,  /  =  1, . . .  ,  n  -  1. 

Conditions  1  and  2  of  Definition  10.11  hold  forgi,ga, ....  But 

1.  sup  {E(gn,  P):n  =  1,2,...}?*  E(f,  P)  since  E(gn,  P)  s  0; 

2.  Uniform  convergence  fails  since  for  each  n  there  are  values  of  *  for 
which /(#)  —  gn(x)  is  arbitrarily  close  to  1 ; 

3.  (10.3)  of  Lemma  10.2  fails  since,  with  &  =  {{*:0  <,  u(z)  <  c }: 
0  ^  c  <  1},  A)  =  P(X)  =  1  and  sup  (P(A).A  e  35}  =  0. 

10.4  PREFERENCE  AXIOMS  AND  BOUNDED  UTILmES 

Because  a  number  of  conditions  will  be  used  in  the  theorems  that  follow 
we  shall  first  summarize  most  of  these  conditions.  In  all  cases,  A  is  a  Boolean 
algebra  for  X  and  3*  is  a  set  of  probability  measures  on  A.  No  notational 
distinction  will  be  made  between  x  e  X  and  the  one-point  measure  that 
assigns  probability  1  to  x.  With  <  defined  on  3“,  x  <  y  oP  <  Q  when 
P(z)  =  Q(y)  =  1.  Similar  meanings  hold  for  x  <  P,  x  P,  and  so  forth. 
As  usual  P  ^  Q  o(P  <  Q  or  P ~  Q),  with  P ~  Q  o  (not  P  <  Q,  not 

Q  < 

We  list  first  some  primarily  “structural”  conditions. 

51 .  {z}e  A  for  every  x  e  X. 

52.  {x:x  e  X,  x  <  y}  e  A  and  {x:x  e  X,  y  <  x}  e  A  for  every  y  <=  X. 

53.  3*  contains  every  one-point  probability  measure. 

54.  31  is  closed  under  countable  convex  combinations  (Definition  10.4). 

55.  31  is  closed  under  the  formation  of  conditional  probabilities  (Definition 
10.8). 

Conditions  51  and  53  enable  us  to  define  a  utility  function  on  X,  and  52, 
which  looks  very  much  like  some  topological  axioms  of  former  chapters  (see 
Theorems  3.5  and  5.5),  guarantees  that  u  on  X  is  A-measurable.  In  the 
present  context  A  could  be  the  set  of  all  subsets  of  X  (i.e.,  the  discrete 
topology  for  X)  and  no  problems  would  result.  At  worst  we  might  have  to 
deny  countable  additivity.  On  the  other  hand,  the  use  of  the  discrete  topology, 
which  in  general  implies  that  (X,  75)  is  not  connected,  would  have  disastrous 
effects  on  former  theory. 

The  following  preference  axioms  (in  addition  to  52)  include  the  three 
conditions  of  Chapter  8  along  with  three  versions  of  a  kind  of  dominance 
axiom.  It  is  to  be  understood  that  these  conditions  apply  to  all  P,  Q,  Re  if, 
A  e  A,  and  y,zeX. 


138 


Expected  Utility  for  Probability  Measures 


Al.  <  on  $  is  a  weak  order, 

A2.  (P  <  Q, 0  <  «  <  I)-v>  xP  +  (1  -  x)R  <  xQ  +  (1  -  x)R. 

A3.  (P  <  Q,  Q  <  R)=>xP  +  (1  -  x)R  <  Q  and  Q  <  0P  +  (1  -  $)R 
for  some  x,  0  e  (0,  1). 

A4a.  (P(A)  —  1,  Q  <  x  for  all  x  e  A)  =>  Q  P.  (P{A)  =  1 ,  x  <  R  for  all 
x  eA)=>P  <  R. 

A4b.  ( P(A )  =  1,  y  <  *  for  all  z  e  A)  =>  y  ^  P.  ( P(A )  »  1,  *  <  z  for  all 
x  eA)=>P^  z. 

A4c.  ( P(A )  =  1,  y  <  x  for  all  x  e  A)  =>  y  ^  P.  (P(A)  =  1 ,  x  <  z  for  all 
xe  A)  =>P  ^  z. 

The  final  three  conditions  are  weak  versions  of  the  following  translation 
of  Savage’s  P 7  (1954,  p.  77):  (P(A)  =  1,  Q  <  x  for  all  *  e  A)  =>  Q  <  P,  and 
(P(A)  =1 ,  x  <  R  for  all  x  e  A)  =>  P  <  R.  Axiom  A4a  is  weaker  (assumes  less) 
than  this  since  it  replaces  ^  by  <  in  the  hypotheses.  A4c  is  weaker  than 
A4b  for  the  same  reason.  A4b  is  weaker  than  the  translation  of  Pi  since 
it  deals  only  with  one- point  measures  in  part.  Under  S3,  ( PI  translation)  => 
(A4a,  A4b,  A4c),  A4a  =>  A4c,  and  A4b  =>  A4c.  Axiom  A4b  does  not  generally 
imply  A4a,  as  can  be  seen  from  the  proof  of  Theorem  10.2  in  the  next  section. 
However,  under  the  other  hypotheses  given  above  (51-v43),  A4b  A4a  when 
every  PetT  is  countably  additive:  this  follows  easily  from  Theorem  10.3. 
Under  conditions  SI -A3,  A4a  =>  A4b. 

In  general,  the  dominance  or  sure-thing  conditions  A4a,  A4b,  and  A4c 
seem  reasonable,  although  A4b  might  be  liable  to  criticism  in  the  case  where 
indifference  is  not  transitive. 

Bounded  Consequence  Utilities 

The  first  result  based  on  the  new  dominance  conditions  uses  the  weakest 
one  of  A4a,  A4b,  and  A4c.  We  know  from  Theorem  8.4  that  (10.14)  and 
(10.15)  follow  from  Al,  A2,  A3,  and  54. 

LEMMA  10.5.  Suppose  that  there  is  a  real-valued  function  u  on  ‘S  for  which 

P<Qou(P)<  u(Q),  for  all  P,Qe$,  (10. 14) 

u(xP  +  (1  -  x)Q)  =  oc u(P)  +  (1  -  a )u(Q),  for  all  (a,  P,  Q)  e  [0, 1  ]  X  !f», 

(10.15) 

and  suppose  that  51,  S3,  54,  and  A4c  hold.  Then,  with  u(x)  =  w(P)  when 
P(x)  =  l,  u  on  X is  bounded. 

Proof.  Under  the  hypotheses,  suppose  u  on  X  is  unbounded  above. 
Then  there  are  xlt  xt, , . .  such  that  u(x,)  ^  2‘  for  i«l,2 . By  S4, 


Theorems 


139 


2-'z„-h  e  iT  for  n  —  0,  1 , 2, . . .  .  By  the  easy  extension  of  (10.15) 
u(f  2~V)  «  £ 2-‘u(^)  +  2-m(|2-0 

V-i  /  <«i  V«i  / 

so  that,  since  u(x{)  ^  2‘, 

“(l.rS a " + r"u(|,r '*■«)• 

Since  y  <  a?,  for  all  /  greater  than  some  m  and  for  some  yeX,  A4c  yields 
y  <  2<-x  ^~ixn+i  for  every  n^m.  Therefore,  by  (10.14), 

u^£2~<a;<j  2>  n  *f  2~Bi/(y)  for  n  =*  m  +  1,  m  +  2, . . .  . 

But  this  is  false  since  2"%)  is  a  real  number.  Boundedness  below  is 

established  by  a  symmetric  contradiction.  ♦ 

103  THEOREMS 

By  Theorem  8.4,  the  hypotheses  of  each  theorem  in  this  section  imply  the 
existence  of  a  real-valued  function  «  on  (S'  that  satisfies  (10.14)  and  (10.15)  and 
is  unique  up  to  a  positive  linear  transformation.  As  shown  by  Lemma  10.5, 
u  on  X  is  bounded.  The  question  then  is  whether  u(P)  —  E(u,  P)  for  all 
P  6  IT,  which  is  true  if  and  only  if  there  is  a  real-valued  function  u  on  X  such 
that 

P<Qo  E[u,  P)  <  E(u,  Q),  for  all  P,Qe$.  (10. 16) 

In  the  following  theorems  u  is  presumed  to  satisfy  (10.14)  and  (10.15).  These 
theorems  show  the  weakest  one  of  A4cf,  A4b,  and  A4c  that  will  yield  (10.16) 
for  various  (F  sets.  The  p-  means  “do  not  imply  for  all  possible  cases.” 
H  =  (SI,  S2,  S3,  S4,  S5,  Al,  A2 ,  A3}. 

THEOREM  10. 1 .  (If,  A4a)  =>  (10.1 6). 

THEOREM  10.2.  (H,  A4b)  p  (1 0.1 6). 

THEOREM  10.3.  (H,  A4b,  every  P  is  countably  additive)  =>  (10.16). 

THEOREM  10.4.  (H,  A4c,  every  P  is  countably  additive,  x  <  y  for  some 
x,yeX)p  (10.16). 

THEOREM  10.5.  (H,  A4c,  every  P  is  discrete,  x  <y  for  some  x,  y  e  X)=> 
(10.16). 


140 


Expected  Utility  for  Probability  Measures 


THEOREM  10.6.  (H,  A4c,  every  P  is  discrete)  4>  (10. 16). 

The  three  “positive”  theorems,  Theorems  10.1, 10.3,  and  10.5  are  proved  in 
the  next  section.  The  proofs  of  the  three  “negative”  theorems  are  given  in 
this  section  with  specific  cases  where  the  hypotheses  hold  and  (10.16)  fails. 
These  proofs  illustrate  some  of  the  differences  between  measures  that  are  not 
countably  additive  and  those  that  are,  and  between  countably  additive 
measures  that  are  not  discrete  and  those  that  are. 

Proof  of  Theorem  10.2.  Let  X  —  {0, 1 , 2, . . .}  with  u(x)  =  xf{\  +  x)  for 
all  x  e  X.  Let  if  be  the  set  of  all  probability  measures  on  the  set  of  all  subsets 
of  X  and  define  u  on  if  by 

u{P)  =  E(u,  P)  +  inf  {P(u(x)  £  1  -  €>:0  <  e  £  1}. 

The  expression  P(u(x)  £>  1  —  e)  is  a  common  shortening  of  P({x:u(x)  ^ 
1  —  *}).  Define  <  on  if  by  P  <  Q  <=>  u(P )  <  u(Q)  so  that  (10.14)  holds.  By 
Exercises  6,  7,  8,  and  18,  (10.15)  holds  since 

u(<xP  -f  (1  —  x)Q) 

~E(u,*P  +  (l  -«)0) 

+  inf  {aP(u(*)  £  1  -  c)  +  (1  -  a.)Q(u(x)  £  1  -  c):0  <  e  £  1} 
=  <*E(u,  P)  +  (1  —  a)E(u,  (?)  +  a  inf  {P(u(x)  ;>  1  —  e):0  <  e  <£  1} 
+  (1  -  a)  inf  {Q(u(x)  £  1  -  *):0  <  e  £  1} 

=  clu(P)  +  (1  ~  oc)u(Q). 

H  then  follows  from  Theorem  8.4,  and  A4b  holds:  if  P(A)  —  1  and 
y  <  x  for  all  xeA  then  u(y)  ^  u{P)  since  u(y)  <,  E(u,  P)\  if  P(A)  =  1 
and  x  ^  z  for  all  x  e  A  then  u(P)  <,  u(z)  since  u(z)  <  1  —  €  for  some  e  >  0 
and  therefore  inf  {P(u(x)  2»  1  —  e):0<e<^l}  =  0. 

Let  P  be  diffuse  with  P(x)  =  0  for  all  x  e  X.  Then  u(P)  =1  +  1=2  since 
inf  {P(u(x)  ^  1  —  e):0  <  e  <,  1}  =  1.  Hence  u(P)  5*  E{u,  P).  ♦ 

Proof  of  Theorem  10.4.  Let  X  —  [0,  1 J,  let  +  be  the  set  of  all  Borel  sets  in 
[0, 1].  Take  if  as  the  set  of  counts  bly-additive  measures  on  +.  Set  u(x)  =  —  1 
if  x  <  £  and  u(x)  =  1  if  x  ;>  i,  and  let 

u(P )  =  2  «(*)?(*),  ^r  all  P  e  iT.  (10.17) 

x 

u  on  if  is  well  defined  since  P(x)  >  0  for  no  more  than  a  countable  number  of 
xeX.  Define  P  <  Q  o  u(P)  <  u(Q).  Then  (10.15)  follows  easily  from 


Theorems 


141 


(10.17).  The  conditions  in  H  hold  and  A4c  holds  since 

1.  (f*(X)  *  1,  y  <  x  for  all  x  e  A)  =>  A  £  [£,  1],  y  e  [0,  $),  and  therefore 
-1  =  «(y)  <  0  <>  u(P),  and 

2.  (P{A)  =  1 ,  x  <  z  for  all  x  e  A)  =>  A  £  [0,  £),  z  e  [£,  1],  and  therefore 
u(P)  £  0  <  u(z)  =  1. 

But  with  Q  the  uniform  measure  on  [$,  1],  0  =  u(Q)  &  E(u,  0=1.  ♦ 

Proof  of  Theorem  10.6.  Let  X  =  {1,2,.. .},  let  A  be  the  set  of  all  subsets 
of  X  and  let  iT  be  the  set  of  all  discrete  probability  measures  on  A,  Let  8 
be  the  set  of  subsets  of  $  defined  by 

8  a*  {S: S  s  if;  if  plt . . . , pnt  are  all  different  measures  in  S 

and  if  a*  ^  0,  &  ^  0  and  a,  “  2”  fit  =*  1  thcn  2*  ai^<  **  2i*  PtQtl 
8  contains  all  one-point  measures). 

A  simple  measure  is  in  8  e  8  only  if  it  is  a  one-point  measure.  The  measures 
in  any  S  e  8  are  independent  with  respect  to  finite  convex  combinations.  A 
maximal  independent  subset  is  an  S*  e  8  such  that  S*  c  S  for  no  S  e  8  and, 
if  P  e  3*  and  P$  8*  then  there  are  positive  numbers  a,, . . . ,  an,  fin 

with  2  af  *=  2  Pi  =  1  and  distinct  measures  Pif . . .  ,  Pn,  Qlt . . . ,  Qm  e  S* 
such  that 


*iP  +  2*.^*  -'IfiiQi 

*-2  /-X 


(10.18) 


Using  Zorn’s  Lemma  (Section  2.3)  it  is  easily  shown  that  8  has  a  maximal 
element  8*.  It  can  be  shown  also,  but  is  tedious  algebraically  to  do  so,  that 
each  P  $  S*  has  an  essentially  unique  representation  in  the  form  of  (10.18). 

If  u  is  defined  on  the  measures  in  S*,  its  linear  extension  to  all  of  if  is 
defined  from  (10.18)  thus: 


u(P)=  \lPAQ,)-i«MPi) 

J-l  1-2 


l  *1- 


To  establish  Theorem  10.6  define  u(x)  =  0  for  all  x  e  X  and  let  u(P)  *s  l 
for  every  Pe  8*  that  is  not  simple.  Let  u  on  S*  be  extended  linearly  by 

(10.18)  to  all  of  if  and  define  P  <  Q  <=>  u(P)  <  u(Q).  Then  H  is  seen  to 
hold  and  ^4c  holds  for  the  simple  reason  that  x  <  y  for  no  x,  y  e  X.  Hence 
the  hypotheses  of  Theorem  10.6  hold.  But  (10.16)  is  clearly  false.  ♦ 

A  variation  on  this  example  shows  that  A4c  cannot  be  deleted  from  the 
hypotheses  of  Theorem  10.5.  Take  u(x)  —  x  for  each  *e{l,2, . . .}  and 
u(P)  s=  0  for  each  nonsimple  Pe  S*  and  extend  u  linearly  by  (10.18)  to  all  of 
0*.  Define  P<  Qo  u(P)  <  u(Q).  With  yxR  +  =  £5-i  $sSf  a,onS 

with  P  as  in  (10.18)  we  get  u{xP  4-  (1  —  x)R)  —  olu(P)  +  (1  --  a )u(Jt)  so 


142 


Expected  Utility  for  Probability  Meotoret 


that  (10.15)  holds.  Moreover,  x  <  y  for  some  x,  ye  X.  But  u  on  X  is  un¬ 
bounded  and  therefore,  by  Lemma  10.5,  A4c  must  be  false.  Clearly,  (10.16) 
fails,  for  otherwise  we  could  construct  a  P  with  infinite  expected  utility. 


10.6  PROOFS  OF  THEOREMS  10.1,  10J,  AND  10.5 

For  Theorem  10.1  let  H and  A4a  hold,  let  u  on  tF  satisfy  (10.14)  and  (10.15), 
and  define  u  on  X  as  in  Lemma  10.5.  u  on  X  is  bounded.  We  note  first  that 

P(A)  =  1  =>  inf  {u(x)\x  eA}<>  u(P )  £  sup  {u(x):x  e  A).  (10.19) 

Let  c  »  inf  and  d  ==  sup  in  (10.19).  To  the  contrary  of  (10.19),  suppose  that 
d  <  u(P).  Then,  for  any  xeA,  (10.15)  implies  that  there  is  a  convex  com¬ 
bination  R  =  ocP  +  (1  —  ct)x  such  that  d  <  u(R )  <  u(P).  Therefore,  by 
(10.14),  x  <  R  for  all  x  e  A  and  hence  P  R  by  A4a.  But  this  contradicts 
u(R)  <  u(P).  Hence  d  <  u{P)  is  false.  u(P)  <  c  is  seen  to  be  false  on  u„ing 
the  other  half  of  A4a.  Hence  (10.19)  holds. 

Let  a  —  inf  {u(x):x  e  X}  and  b  =  sup  {u(x):x  e  X),  and  let  Ai  n  be  defined 
by  (10.9).  ForPe  STletn*  =  {/:/'  e  {1, ... ,  n},  P(At  n)  >  0}.  Then,  by  H and 
(10.7),  P  =  2„.  Pa„P(A,J,  so  that  u(P)  -  I..  u(PA<JP(Aln)  by  (10.15). 
Hence,  by  (10.9)  and  (10.19), 

3  [a  +  0  -  1  )(b  -  a)/n]P(Ai<n)  <,  u(P)  £  £  [a  +  i(b  -  a)ln]P(Aitn). 

"  "*  (10.20) 

Since  u  is  bounded  and  .^-measurable  and  .  •  .  define  by  (10.10) 
converges  uniformly  from  below  to  u.  Definition  10.12  gives 

E(u,  P)  =  sup  [a  +  (/  -  1  Kb  -  a)ln]P(At  ^:n  =  1, 2, . .  .J. 

Since  the  difference  between  the  two  sums  in  (10.20)  equals  (b  —  a)/n,  which 
goes  to  0  as  n  gets  large,  u(P)  =  E(u ,  P).  ♦ 

Proof  of  Theorem  10.3.  Let  H  and  A4b  hold  and  assume  that  every  P  e  (T 
is  countably  additive.  With  u  as  in  the  preceding  proof,  we  need  to  verify 
(10.19).  Then  «(P)  =  E(u,  P)  follows  from  the  second  half  of  the  proof  of 
Theorem  10.1. 

Let P(A)  =  l,c  =  inf  {u(x):x  e  A},  and  d  ~  sup  {«(*):ar  e  A}.  If  {u(x):x  e 
A}  —  (c,  d],  (10.19)  follows  from  A4b  and  (10.14).  Henceforth,  assume  that 
c  <  u(w)  <  d  for  a  fixed  we  A,  and  let 


(10.21) 


Aw  =  { x:xeA,x  <  w),  O’*  =  {Q:Q  e(T,  Q(AV)  =  1} 

A"  =  {x:x  eA,w <  x},  S"  =  {Q:Q  e  IP,  Q(AV)  =  1}. 

A  ~  Av  (j  A"  with  Av9*  0  and  A*  ^  0.  Let  B  =  {x:xe  X,  x  <  w) 

so  that  B  g  A  by  52.  Then  Am  e  A  since  Av  =  A  n  B  =  [Ae  u  B9)*. 


Proof*  of  Theorem*  10.1,  103,  end  10.5 


143 


Similarly,  A*  e  A.  Then,  by  (10.7)  and  S5,  P  equals  a  convex  combination 
of  a  measure  in  ‘SK  and  a  measure  in  if1".  It  follows  from  (10.15)  that  (10.19) 
holds  if  it  holds  for  every  measure  in  (Tw  u  ‘S*. 

To  verify  that  Q  e  tT"  =>  c  <,  u{Q)  ^  d,  we  note  first  that 

c  ^  w(0  for  every  Qei f w  (10.22) 

follows  from  c  <  u(w),  A4b,  (10.14),  and  (10.21).  It  follows  from  (10.22) 
and  an  analysis  like  that  used  in  the  proof  of  Lemma  10.5  that  u  on  is 
bounded  above.  Thus,  let  M  be  such  that 

£  u(Q)  £  M  for  all  Q  e  IT".  (10.23) 

If  u(x)  =  d  for  some  x  e  Aw  then  u(Q)  <  d  for  all  Qetf"  by  A4b  and  (10.14) 
so  that  c  <,  u{Q)  <  d  for  this  case.  Alternatively,  suppose  that  u(x)  <  d  for 
all  x  e  A*  and  with  e  >  0  let 

A(«)  =  {x:x  eAv,  u(x)  <</—«) 

P(e)  ?=  {x:x  eAv,  d  —  e  n(df)}. 

Then  A(c)  u  B(e)  =  Aw  and  {A(e):e  >  0}  is  weak  ordered  by  cz  so  that 
for  any  Q  e  0’“'  it  follows  from  (10.3)  of  Lemma  10.2  that 

sup  {Q(A(e)):e  >  0}  «  1.  (10.24) 

If  Q(A(c))  =  1  for  some  e  >  0  then  u(Q)  <  d  by  (10.14)  and  A4b.  On  the 
other  hand,  if  Q(A(c))  <  1  for  all  e  >  0  then,  with  e  small  and  QA((I,  QVU) 
respectively  the  conditional  measure  of  Q  given  A(e ),  B(e),  it  follows  from 
(10.15)  and  (10.7)  that 

u(Q)  =  Q(A(<))u(Qau))  +  Q(B(e))u(QBU)). 

Hence,  by  (10.23)  and  QMtl(A(ey)  *  1  ,u(Q)  <  Q(A(e))d  +  [1  -  Q(A(e))]M 
for  all  small  c  >  0.  u(Q)  ^  d  then  follows  from  (10.24).  Hence  Q  e  9*w  => 
c  <,  u(Q)  <,  d.  By  a  symmetric  proof,  Q  e  1TW  =>  c  <,  u(Q)  <,  d.  ♦ 

Proof  of  Theorem  10.5.  Let  the  hypotheses  of  Theorem  10.5  hold.  Since 
every  P  is  assumed  to  be  discrete,  A  is  a  a- algebra.  With  a  =  inf  {«/(*):*  e  X) 
and  b  =»  sup  {u(x):x  eX},  a  <  b  since  x  <  y  for  some  x,yeX.  We  shall 
prove  first  that  u  on  3*  is  bounded. 

If  of  <  u(w)  <  b  for  some  we  X,  boundedness  of  u  on  (F  follows  from  an 
analysis  like  that  using  (10.22)  in  the  preceding  proof.  Henceforth  in  this 
paragraph  assume  that  {u(x):x  eX}  -  {a,  b)  and  let 

iTa  «  (P:P  6  (P,  P(u(x)  —  a)  —  1 } 

3*6  ~  (P:P  e  O',  P(u(x)  =  b)  —  1}, 


144 


Exptcttd  Utility  for  Probability  Mttuom 


so  that  every  P  e  is  a  convex  combination  of  one  measure  from  each  of 
tT.  and  (10.15)  says  that  u  on  tT  is  bounded  if  u  is  bounded  on  £Ta  u  CT6. 
For  O'*,  an  analysis  like  that  using  (10.22)  applies:  a  ^  u(P)  for  all  Pe  (T* 
by  A4c  and  (10.14).  A  symmetric  analysis  shows  that  u  on  O',  is  bounded. 

If  P  is  simple,  u(P)  —  E(u,  P)  follows  from  (10.15).  If  P  is  not  simple  and 
A  —  {x:P(x)  >  0},  Lemma  10.3  gives  P(x)  =  1.  With  the  elements  in  A 
enumerated  as  xlf  P  P(xt)x(.  Hence,  by  the  finite  extension 

of  (10.15), 

U(P)  =  i  P(xM*<)  +  f  i P(xt)]ul  W i P(*<)l”1xA  (10.25) 
»  Lna  J  W»  L«+i  J  / 

for  «  =  1,2 . And  by  Exercise  20 a, 

E(u,  P)  =  X  />(*,)£(«,  x,)  +  ("  |  PCxSIeI m,  |  />(*,)[ |  P(x<)l“1xi'| 

1  LK+1  J  \  «+J  Ln+1  J  / 

(10.26) 

for  n  *  1, 2, ...  .  Since  £(u,  x<)  =  «(*,),  it  follows  from  (10.25)  and  (10.26) 
that 

u(P)  -  E(u,  P )  +  [ | 7>«] [»( i-  ■  •  •  ■  *)].  (10.27) 

Since  u  on  ‘S  is  bounded,  since  E(u,P )  on  3“  is  bounded  when  u  on  X  is 
bounded,  and  since  2»+i  P(xt)  approaches  0  as  n  gets  large,  the  second  term 
on  the  right  of  (10.27)  approaches  0  as  n  gets  large  and  therefore  must  equal 
zero  for  all  n.  Hence  u(P)  =  £(«,  P).  + 


10.7  SUMMARY 

The  weak-order  expected-utility  result,  P  <  Q  <=>  E(u,  P)  <  £(«,  Q), 
holds  for  sets  of  probability  measures  that  include  nonsimple  measures  when 
appropriate  dominance  axioms  are  used.  The  basic  idea  of  such  axioms  is 
that  if  a  measure  P  is  preferred  to  every  consequence  in  a  set  to  which  a 
measure  Q  assigns  probability  1 ,  then  Q  shall  not  be  preferred  to  P;  and  if 
every  consequence  in  a  set  to  which  Q  assigns  probability  1  is  preferred  to  P 
then  P  shall  not  be  preferred  to  Q.  This  is  condition  ,44a  of  Section  10.4.  If 
ail  probability  measures  under  consideration  are  countably  additive  then  a 
“weaker”  form  of  dominance  axiom  will  yield  the  expected-utility  result  in 
conjunction  with  the  preference  conditions  of  Chapter  8  and  several  structural 
conditions  on  the  set  of  measures  and  the  Boolean  algebra  on  which  they  are 
defined. 


Extrclsti 


145 


INDEX  TO  EXERCISES 

I.  Denumerable  sums.  2-3.  Boolean  algebras.  4.  Countable  unions.  S.  o-Algebra. 
6-11.  Infs  and  sups.  12-15.  Countable  additivity.  16.  Uniform  convergence  from  above. 
17.  Expectations  of  sums.  18—19.  Expectations  with  convex  combinations  of  measures. 
20.  Conditional  expectations.  21.  Expectations  are  sums.  22-24.  Dominance  and  expecta¬ 
tions.  25.  52.  26.  Failure  of  A4a.  27-28.  Proof  of  Theorem  10.6.  29.  Blackwell-Girshick 
Theorem. 

Exercises 

1.  Prove  that  2jLi  2r*  =»  1  —  2~“  by  noting  that  2(2"  2_<)  -  2*  »  1  —  2“*. 

Also  show  that  0  <  p  <  1  implies  2iT-i/,{1  “  pT~l  "  I. 

2.  Show  that  A  is  a  Boolean  algebra  if  and  only  if  A  is  a  nonempty  set  of  subsets 
of  X  satisfying  (2)  and  (3)  of  Definition  10.1. 

3.  Let  JC  be  the  set  of  all  subsets  of  {1,  2, . . .}  that  are  either  finite  or  contain 
all  but  a  finite  number  of  positive  integers.  Show  that  JL  is  the  Boolean  algebra 
generated  by  {{1},  {2}, . . 

4.  Specify  (J<*  i  At  «  {*:a:e  A(  for  some/}  when  (a)  A{  *>  0 ,  (b)  At  =  {  — /,  i), 
(c)  A(  =  (1/(1  +  /),  1/0  £  Re,  (d)  A{  =  II//,  2  -  1//]  £  Re. 

5.  Describe  the  ^-algebra  generated  by  {{x}  \x  e  Re}. 

6.  Let  R  be  a  bounded  set  of  numbers.  Prove: 

a.  sup  R  —  —inf  {r:  -re  /{}, 

b.  sup  {ar:re  R}  =»  «  sup  R  if  a  £  0, 

c.  sup  {<* r  :r£  R}  =*  a  inf  R  if  ot  ^  0, 

d.  inf  {ar:r  e  /?}  *  a  inf  R  if  a  ^  0, 

e.  inf  {txr.re  R}  —  a  sup  R  if  «  <,  0. 

7.  With  R  and  S  bounded  sets  of  numbers  prove  that  sup  (r  +  s-.r  e  R,  s  £  S'}  <= 
sup  R  +  sup  S.  Then  prove  Lemma  10.1. 

8.  ( Continuation .)  Prove  that  sup  {a,-  +  /?,:/  =  1,2,,..}  =  sup  {a*:/  =  1, 
2, . . .}  +  sup  { pf:i  =  1,2,...}  if  >i,  «j, . . .  and  pt, . . .  are  nondecreasing 
sequences  of  real  numbers  that  are  bounded  above.  Generalize  this  result  to  n 
nondecreasing,  bounded  sequences. 

9.  ( Continuation .)  Suppose  ^  0  for  all  j,  2*«i  ttj  <  M  for  all  positive  integers 
n  and  some  number  M,  and  for  each  j  (j  =  1,2,.. .)  plf,  fi2j, ...  is  a  bounded 
nondecreasing  sequence  of  nonnegativc  numbers.  Using  (10. 1 )  prove  that 

(Q0  J  at) 

2  BjPifi  =  1,  2, ...»  2  <MSUP  =  1.2... .}]. 

i=i  J  >-i 

10.  ( Continuation .)  With  the  <xi  as  in  the  preceding  exercise,  suppose  that,  for 


146 


Expected  Utility  for  Probability  Measures 


V|/.  y*/. ...  is  a  nonincreasing  sequence  of  nonnegative  real  numbers.  Prove 
that 

mf  I  J  aiYii-i  .  1.2,  — \  -  J  Minf  {yuU  -1,2,.. .}]. 

w-i  /  t-i 

11.  Let  Pt,  Pt, ...  be  a  sequence  of  probability  measures  on  the  set  of  all  subsets 
of  X,  let  «,  k  0  for  all  /  with  2<“  i  «<  -  1 ,  and  for  any  probability  measure  P  and 
real-valued  function  u  on  X  define 

lim /*(«(*)  ^  r  -  f)  =  inf  {P({x:u(x)  ^  r  -  -1,2,...} 

where  >  <t  >  •  -  •  and  inf  1,  2, -  0.  Use  the  result  of  the  preceding 

exercise  to  prove  that,  for  any  real  number  r, 

lim  2  *iPi(u(x)  t  r  —  <)  «=  2  *i  lim  P,(«(x)  ^  r  —  «)  I. 

<- 1  L*-*o  J" 

12.  Let  P  be  defined  on  JL  of  Exercise  3  on  the  basis  of  P(«)  =  2~"  for  n  =»  1 

2 . Prove  that  P  is  countably  additive. 

13.  Use  the  conclusion  of  Exercise  9  to  prove  (P,  is  a  countably-additive  probability 

measure  on  A  for  i  —  1  2  «,  ~  ,  7 

......  *  .  2  uana2,,„j  <*<  -  ia<^<«sacountabIy- 

additive  probability  measure  on  ji. 

14  Prove  that  if  P  on  A  is  countably  additive,  if  a  is  a  countable  subset  of  A 
wealdy  ordered  by  <= ,  and  if  A  e  A,  then  P(f|a  A)  -  inf  {P(^):^|  e  a}. 

Note:  In  Exercises  1 5  through  24,  -4  is  a  Boolean  algebra  on  X;J \g,  ...are  bounded 
and  A-measurable;  P,Q,...  are  probability  measures  on  A. 

[f  f  is  countably  additive  show  that  £(/,  P)  is  unambiguously  defined  by 
(10.1  U  when  /„/*, ...  is  a  sequence  of  simple  A-measurable  functions  that  satisfies 
conditions  1  and  2  of  Definition  10. 1 1 . 

16.  A  sequence  £^,£2, ...  of  simple  ^-measurable  functions  converges  uniformly 
from  above  to  /if  and  only  if,  for  all  xeX, 

L  gi(x)  £&(*)  £  •  •  • 

2.  g(x)  =  inf {gt(x):i  =  1,2,...} 

3.  c  >  0  gn(x)  £g(x)  +  «  for  some  n  (and  all  x). 

Prove  that  sup  {£(/.,«:«-  1, 2, ...}  =  inf{£^.,P):»  -  1,  2,...}  when/,. 
/•»  •  •  Kg i.<?«.  •  •  •)  converges  uniformly  from  below  (above)  to /. 

17.  With  c  a  real  number  let  /  4-  c  be  the  function  on  X  that  takes  the  value 
/(*)  +  c  at  x ex,  let  c/be  the  function  that  takes  the  value  c/(x)  a txeX,  and  let 
f  +  g  have  value  / (x)  +  g{x)  at  x.  Prove 

a.  £(/  +  c,P )  =  £(/  P)  +  c, 

b-  £(f+g>  P)  -  £(/,  P)  +  £(*,  P), 
c.  E(cf,  P)  =  cE(f,P). 

18.  Show  that  if  *  e  [0, 1]  then 

£(/,  <*P  +  (1  -  «)Q)  =  <*£(/,  p)  4.  (|  _  a)£>(y  q) 
and  then  generalize  this  to  £(/,  2JU  «./*,)  -  If.,  <*,£(/,  Pt). 


(10.28) 


Exercises 


147 


19.  ( Continuation .)  Supposing  that  a<  k  0  for  /  =  1,2,...,  *<  *s  finite,  and 

blt  bt, . . .  is  a  bounded  sequence  of  numbers,  define  2jli  a^<  fr°m  (^.1)  by 

i  aA  a»<&  +  c)  —  c2<-i  a<  where  c  is  such  that  +  c  1 0  for  all  i. 

Show  that  is  well  defined.  Then  use  this  definition  along  with  Exercise 

17a  and  Exercise  9  to  prove  that  if  xt  ^  0  for  all »  and  a,  =  1  then 

E(f>  I  =  i  *<£(/,  *»<)-  00-29) 

\  «=i  /  i-i 

20.  Use  the  results  of  the  two  preceding  exercises  along  with  (10.7)  and  the  sen¬ 
tence  following  its  derivation  to  show  that,  given  As  A., 

a.  [/»(.4)  «=  1,  {Ax, . . . ,  An}  is  an  jt-partition  of  A,  I  =  (/:i>(/4i)  >  0}1  => 

E{f,P)  ~hnAdE{f,rA)t 

b.  [P{A)  =  1 ,  {Ax,  A2, . . .}  is  a  denumerable  partition  of  A  with  A(e  A  for  all  i, 
I  =  {i:P(Ai)  >  0},  P  is  countably  additive]  =>  E(f,  P)  =  2/  P(Ai)E(f,  PA). 

21.  Suppose  all  xi  are  different  in  each  of  b  and  c.  Show  that 

a.  P(x)  =  1  E(f,P)  **/(*). 

b-  W  -  1  =>  E(f,  P)  =  2»_1  P(x{)f{x{). 

c.  *(*<)  -  1  =>E{f,  P)  =  J- 1  P(x{)f{Xi). 

22.  Assume  that  AE  A  and  P(A)  =  1 .  Prove  that 
[/(*)  <.  gi*)  for  all  xeA]=>  E(f,  P)  <,  E(g,  P), 

b .  [f{x)  £g(x)  for  all  xeA,  P{f{x)  +  «  <, g(x))  >0  for  some  «  >  0] 
E(f,P)<E(g,P), 

c.  [f(x)  <  g{x)  for  all  x  e  A,  P  countably  additive]  =>  E(f,  P)  <  E(g,  P '). 

23.  ( Continuation .)  Give  an  example  where  Pifix)  <  g(x))  —  1  and  not  E{f, 
P)  <  Eig,  P).  Pifix)  <  gix))  -  Pi{x:f{x)  <  gix)}). 

24.  With  u  satisfying  (10.16)  let  Ay  =  {x;xsX,x  <  y},  Prove  that  [P(-4V)  <, 
Q(Ay)  for  all  yeX\=>  E{u,  Q)  <;  £(«,  P),  Prove  also  that  [Q  -A  P{  for  i  —  1, 
2, ....  n,  «(  £  0  for  all  i,  2"  <*.  =  1. £  C(^»)  for  a11  »  G  JT]  => 
£(m,  Q)  <;  £(«,  P.)  for  some  /. 

25.  Show  that  Definition  10.1,  52,  and  <  on  X  connected  imply  {x:xEX, 
y  <  *  ^  z)  e  A  and  {x:x  E  X,y  <x  <z)  E  A. 

26.  Give  specific  examples  of  probability  measures  that  demonstrate  the  failure 
of  A4a  in  the  proofs  of  Theorems  10.2  and  10.4. 

27.  Use  Zorn’s  Lemma  to  prove  that  8  in  the  proof  of  Theorem  10.6  has  a  maxi¬ 
mal  element  s*. 

28.  Verify  that  the  representation  (10.18)  for  {P  6  P  $  8*)  in  terms  of  measures 
in  S*  is  essentially  unique. 

29.  Blackwell  and  Girshick  (1954).  Prove  that  if  I  is  the  set  of  all  discrete  prob¬ 

ability  measures  on  the  set  of  all  subsets  of  X  and  if  A\  and  /43  of  Section  10.4 
hold  along  with  [P(,  Q,  e  3\  P,  ^  Q,  and  *,  £  0  for  /  =  1, 2, . . . ;  *<=»!; 

Pi  <  Qi  for  some  i  for  which  a,  >  0]  =>  a,P,  <  x  a {Qit  then  there  is  a 
real-valued  function  u  on  X  that  satisfies  (10.16). 


Chapter  11 

ADDITIVE  EXPECTED  UTILITY 


This  chapter  combines  the  weak-order  expected  utility  theory  of  Chapters  8 
and  10  with  the  situation  where  the  consequences  in  X  are  /i-tuples  as  in 
Chapters  4,  5,  and  7.  The  main  focus  of  the  chapter  is  conditions  that,  when 
X  £  Xx  x  Xi  x  *  •  •  x  Xn,  imply  the  existence  of  real-valued  functions 
un  on  Xu  ... ,  Xn  such  that 

P  <  £(«„  Pi)  <  1  E(uiy  Q,),  for  all  P,  Q  e  (T,  (11.1) 

i=I  i— 1 

where  if  is  a  set  of  probability  measures  on  X  and.  for  P  e  (T,  Pt  is  the  marginal 
measure  of  P  on  X{. 

We  shall  examine  (11.1)  first  for  the  case  where  X  =  Xx  x  •  *  •  x  Xn  and 
then  for  the  more  general  case  where  X  £  Xx  X  •  •  •  X  Xn.  Section  1 1 .3  then 
examines  the  case  where  X  —  Xx  x  •  •  •  X  Xn  and  (11.1)  may  fail  but  some 
form  of  additive  interdependence  applies  such  as  u(xu  xif  zs)  =  u1(x1,  x2)  + 
us(x2,  *3)-  Finally,  Section  11.4  looks  at  the  homogeneous  product  set 
situation  where  X  =  An,  as  in  Chapter  7. 

11.1  ADDITIVE  EXPECTED  UTILITY  WITH  X  =  II X, 

To  simplify  our  examination  of  independence  among  factors  in  a  multi¬ 
dimensional  consequence  set  in  the  expected-utility  context,  this  chapter 
assumes  that  probability  measures  for  X  are  defined  on  the  set  of  all  subsets 
of  X.  A  similar  assumption  applies  for  a  measure  defined  for  a  factor  set  Xt. 

Definition  11.1.  Suppose  P  is  a  probability  measure  on  X  c  Xt. 
Then  Pt,  the  marginal  measure  of  P  on  Xit  is  defined  by 

Pt{A i)  =  P({z-.X  e  X,  ar<  6  /4J)  for  all  Ai  £  X,.  (1 1.2) 

In  (1 1.2)  x{  is  the  /th  component  of  x.  When  X  —  JT/L,  Xo  (1 1.2)  becomes 


148 


Additive  Expected  Utility  with  X  —  ILV, 


149 


PiiAi)  =  P(Xx  x  •  •  x  Arj_l  x  At  x  A^+1  x  •  ■  *  x  Xn).  It  is  easily  verified 
that  /*,  is  a  probability  measure  on  X(  when  P  is  a  probability  measure  on  X. 

It  is  possible  to  have  (Pu  . . . ,  PH)  =  (Qlt . . .  ,  Qn)  when  P  ^  Q.  With 
rt  —  2  let  P  and  Q  be  the  simple  even-chance  gambles 

PC  $5000,  $5000)  =  P($ 100000,  $100000)  =  .5 

2(55000,  $100000)  =  £($100000,  $5000)  «  .5. 

Then  (Px,  P^  =  (Qx,  Q%)  although  P  ^  Q,  P  gives  an  even  chance  for  a 
two-year  income  stream  of  either  ($5000,  $5000)  or  ($100000,  $100000). 
Q  gives  an  even  chance  for  an  income  stream  of  either  ($5000,  $100000)  or 
($100000,  $5000).  I  suspect  that  many  people  would  prefer  Q  to  P.  The 
condition  for  (11.1)  requires  that  P  ~  Q.  This  condition  may  seem  more 
reasonable  when  the  different  factors  in  X  are  heterogeneous. 

THEOREM  11.1.  Suppose  that  O'  is  either  the  set  of  simple  probability 
measures  on  X  —  nu  *  or  a  set  of  probability  measures  on  X  =  Xt 
that  satisfies  51  through  55  of  Section  10.4,  and  suppose  further  that  there  is  a 
real-valued  function  u  on  X  such  that ,  for  all  P,  Q  P  <  Q  o  E(u,  P)  < 
E(u,  Q).  Then  there  are  real-valued  functions  uu  . . .  ,un  on  Xu  . .  .  ,  Xn 
respectively  that  satisfy  (11.1)  and  are  unique  up  to  similar  positive  linear 
transformations,  if  and  only  ifP~Q  whenever  P  and  Q  are  simple  measures 
in  3*  such  that  (Plt  ...,/>„)  =  (Qu  ...,£„)  and P{x),  Q(x)  e  (0,  \}for  all 
xeX. 

The  very  last  condition  here  shows  that  (11.1)  can  be  established  on  the 
basis  of  simple  50-50  gambles  when  X  =  ffti  The  necessity  of  the 

indifference  condition  for  (11.1)  is  obvious.  The  sufficiency  proof  follows. 

Proof  Fix  x°  =  (a;J, . . .  ,  ^)  in  X,  assign  ufx°), . . .  ,  »„(#“)  values  that 
sum  to  u(x°),  and  define  «,  on  X{  by 

•  •  • ,  . *«)  - 1  «*(*?)• 


i  *  $ 


(11.3) 


The  indifference  condition  between  50-50  gambles  when  (Plt . . .  ,  Pn)  = 
(£>i,  ■  •  •  ,  QJ  Jeads  directly  to  u(xt, x°M, . . .  ,  afl)  +  u{x{, ...  ,  arf, 
*,°+g,  ...,*“)  =  u(xlf ...  ,  xi+1,  *®.2, . . .  ,  x°n)  +  u(x°)  for  /  =  1 ....  , 
n  —  1 .  Summing  this  from  /  =  1  to  /  =  n  —  I ,  cancelling  identical  terms,  and 

transposing  (n  -  l)u(z°)  we  get  u{x\, ... ,  x»_v  xu  x»+l . *®)  - 

(n  —  l)w(a^,  ...,*")  =  u(x j, . . .  ,  xn),  which  on  comparison  with  (1 1.3) 
shows  that 

n 

u(xx, . . . ,  xn)  =  2  a/**).  for  a11  (*1.  •••.*„)GX 
$«  1 


(11.4) 


150 


Additive  Expected  Utility 


If  S  satisfies  the  Section  10.4  conditions  then  u  on  X  is  bounded  and  hence 
ux  on  Xt,  defined  by  (11.3),  is  bounded.  In  any  event,  although  ut  is  defined 
on  it  is  equivalent  to  a  function  u*  defined  on  X  by  u*(x)  —  u^x.).  Then, 
using  (11.4)  and  Exercise  10.176, 

E(u,  P)  =  E(u?  +  •  •  +  «;,  P) 

=  E(u*,  P)  +  ■•■  +  £(«*  P) 

=  E(ux,  Pt)  +  •  •  •  4-  E(u„,  Pn ) 

which  yields  (11.1)  in  conjunction  with  P  <  Qo  E(u,  P )  <  E(u,  Q). 

Finally,  suppose  that  vx,  . . . ,  vn  on  Xx, . . .  ,  Xn  satisfy  (11.1)  along  with 
ux, ,  «n.  Define  u  and  v  on  the  simple  measures  in  by  u(P)  =  £(k4,  £4) 

and  v(P)  =  £(u4,  P().  It  is  easily  seen  that  u(aP  +  (1  —  <x)Q)  =  <xu(P )  + 

(1  —  <x)h(£?)  and  similarly  for  v  for  simple  measures  P,  Qei f.  Hence,  by 
Theorem  8.4,  v  is  a  positive  linear  transformation  of  u,  say  v  =  au  +  b, 

a  >  0.  We  then  have  2  »<(*<)  =*  2  £(*><»  *<)  =  v(xi . XJ  —  au(x ,, . . .  , 

xn)  +  b  =  a  %  ut(x()  +  b,  from  which  it  follows  that  vt{xt)  =  aut{xx)  + 
[6  +  a^i¥,t  «*(*?)  —  'E.j+i  V)(x*)]  =  m/4(a?,)  +  b{,  for  each  /,  where  bx  is 
defined  in  context.  ♦ 

11.2  ADDITIVE  EXPECTATIONS  WITH  X  £  H  A, 

When  JT  c  E,  (11.1)  does  not  generally  follow  from  the  50-50 
gambles  version  of  the  indifference  condition.  In  general,  we  require  the  more 
general  condition  that  (Px,  ...,£„)  =  (Qlt .  . .  ,  Qn)=>P~Q.  When  X  is 
finite,  (11.1)  follows  from  this  and  P  <  Q  o  E(u ,  P)  <  E(u,  (?)  as  is  noted 
in  Exercise  4.  For  X  infinite,  only  the  n  =  2  case  has  been  satisfactorily 
worked  out  and  then  only  for  simple  probability  measures.  Therefore,  this 
section  examines  only  the  X  ^  Xx  x  Xx  case. 

To  show  one  difficulty  that  may  arise  in  this  case  for  nonsimple  probability 
measures  suppose  X  —  {(0, 0),  (I,  0),  (1 ,  1),  (2,  1),  (2,  2),  (3,  2), . . .}  and  let 
«  on  X,  satisfying  P  <  Q  <=>  E(u,  P)  <  E(u,  Q)  for  all  discrete  measures  P 
and  Q  on  X,  be  such  that  u(k ,  k)  =  0  for  k  =  0,  1 , 2, . . .  and  u(k  +  1  ,k)  = 

1  for  k  =  0,  1, 2 . Set  u^O)  =  «a(0)  =  0  for  (11.1).  Then  for  (11.1)  to 

hold  for  all  one-point  probability  measures  we  must  have  ux{k)  —  k  and 
ut(k)  —  —  k  for  k  =  0, 1 , 2, . . ,  [when  u(xx,  xt)  =*  ux(xx)  +  u2(xa)].  Define  P 

ty 

P(2‘,  2*)  =  2-*  for  k  =  1,2,... 

so  that  £(«,  P)  *  0.  Then  E(ux,  P),  if  defined  at  all,  is  infinite:  E(ux,  P)  — 
21  •  2~l  +  2*  •  2~8  +  ■  •  •  =  1  +  1  +  •  •  •  =  -foo.  Likewise  £(u„  P)  »  -21  • 
2"1  —  2*  •  2~*  —  ■  •  ■  =  —  oo,  so  that  E(ux,  Px)  +  E(ut,  P%)  is  not  meaning¬ 
fully  defined.  Despite  this,  E(u,  Q)  —  E(ux,  Qx)  +  E(ut,  Q%)  when  Q  is 
simple. 


Additive  Expectations  with  X  £  IW 


151 


The  Structure  of  X  c  xt  x  Xt 

Several  special  definitions  that  apply  to  this  section  only  will  be  used  in 
resolving  the  X  £  Xx  x  Xt  case  for  simple  probability  measures. 

Definition  11.2.  (xt,  xa)R(ylt  yt)  if  and  only  if  there  is  a  finite  sequence 
(*lf  xt),  x1, . . . ,  xx,  (ylt  yt)  of  elements  in  X  £  X1  x  Xx  such  that  any  two 
adjacent  elements  have  at  least  one  component  in  common. 

R  is  easily  seen  to  be  an  equivalence  on  X.  We  shall  let  2)  be  the  set  of 
equivalence  classes  of  X  under  R.  In  the  preceding  example  2)  «  {Jf}.  If 
D,  D*  e  2)  and  (a:*,  xj  e  D,  (ylt  y,)  e  D *,  and  D  ^  D*  thtn  it  must  be 
true  that  xt  ^  xt  and  yx  ^  y%-  Hence,  given  u  on  X,  there  will  be  a  uu  ua 
solution  to 

u(x i,  x%)  =  +  ut(xt)  for  all  (xlt  xt)  e  X  (1 1.5) 

as  required  for  (1 1.1)  if  and  only  if  there  is  a  ux,  ut  solution  for  each  D  e  D 
considered  separately.  We  therefore  concentrate  on  an  arbitrary  D  e  3). 
Proofs  of  our  first  three  lemmas  are  left  to  the  reader. 

Definition  11.3.  An  alternating  sequence  in  D  is  a  finite  sequence  of  two 
or  more  distinct  elements  in  D  such  that 

1.  any  two  adjacent  elements  have  one  component  in  common, 

2.  no  three  consecutive  elements  in  the  sequence  have  the  same  first 
component  or  the  same  second  component. 

LEMMA  11.1.  If  x,  y  e  D  and  xyty  then  there  is  an  alternating  sequence  in 
D  that  begins  with  x  and  ends  with  y. 

Definition  11.4.  A  cycle  in  D  is  a  subset  of  an  even  number  of  elements  in 
D  that  can  be  positioned  in  an  alternating  sequence  whose  first  and  last 
elements  have  the  same  first  component  if  the  first  and  second  elements  have 
the  same  second  component  or  whose  first  and  last  elements  have  the  same 
second  component  if  the  first  and  second  elements  have  the  same  first 
component. 

LEMMA  11.2.  If  D  has  no  cycles  then  there  is  exactly  one  alternating 
sequence  in  D from  x  to  y  when  x,y  e  D  and xpty. 

LEMMA  1 1.3.  Suppose  x1, . . .  ,  x*n  is  an  alternating  sequence  in  D  whose 
elements  form  a  cycle.  Suppose  further  that  $  is  a  mixture  set  of  probability 
measures  that  includes  the  simple  measures,  that  there  is  a  real-valued  function 


152  Additive  Expected  Utility 

uon  ‘S  that  satisfies  (8.5)  and  (8.6),  and  that  [P,  Qe$%  (Pl5  Pt)  —  ( [Qx ,  g*)]  => 
P~Q.  Then 

2  a:?-1)  =  J  u(*J‘,  *1*).  (1 1 .6) 

<-i  <-i 

LEMMA  1 1 .4.  If  D  gQ  then  there  is  a  C  £  Z?  sucA  that 

1.  C  includes  no  cycle, 

2.  xRy  for  each  x,yeC  with  xRy  established  by  a  sequence  all  of  whose 
elements  are  in  C, 

3.  (*1-  x»)  e  D  =>  (a?!,  y*)  e  C  and  (ylt  **)  e  C  for  some  yx  e  Xu  yt  e  Xt. 
Proof  With  Z>  e  D  let 

C  =  {C:C£  D,C  satisfies  conditions  i  and  2  of  Lemma  11.4}. 

We  shall  prove  that  C  has  a  maximal  element  that  satisfies  condition  3.  Let 
C*  be  a  subset  of  C  that  is  strictly  ordered  by  <= ,  and  let  C*  =  (Jc*  C. 
C*  e  e  since  C*  ^  D  and 

1.  C*  includes  no  cycle,  for  if  {xl, . . .  ,xn}  £  C*  is  a  cycle  then  with 
xi  e  Cj,  C,  e  C* ,  the  largest  of  these  C{  will  include  {a:1, . . .  ,  x"}  and  this 
contradicts  C,  e  C; 

2.  xRy  if  x,y  e  C*,  for  x,yeC  for  some  C  e  e*.  Thus,  by  Zorn’s  Lemma, 
there  is  a  B  e  G  such  that  B  <=  C  for  no  C  e  C.  Suppose  (apx,  x%)  e  D  and 
(arlt  x2)  d  B.  Then,  since  B  is  maximal,  B  \j  {fo,  ara)}  must  include  a  cycle 
which,  since  B  includes  no  cycles,  contains  (xx,  afj).  It  follows  from  Defini¬ 
tions  11.3  and  1 1.4  that  (xu  yt)  e  B  and  (ylt  x3)  e  B  for  some  ya  e  X,  and 
Vx  e  Xv  + 

Additive  Expected  Utility  with  Simple  Measures 

The  appropriate  theorem  for  (11.1)  with  X  £  Xx  x  X%  and  simple  proba¬ 
bility  measures  follows. 

THEOREM  11.2.  Suppose  (T  is  the  set  of  simple  probability  measures  on 
X  Q  Xi  x  Xi  and  there  is  a  real-valued  function  u  on  X  such  that  P  <  Q  o 
E{u,  P )  <  E(w,  0,  for  all  P,  Q  e  3*.  Then  there  are  real-valued  functions  ut 
on  Xt  and  ut  on  Xs  that  satisfy  (1 1.1)  if  and  only  if  [P,  Q  e  (T,  (Px,  Pj)  — 
{Qx,  Qt) )  =>  P  ~  Q-  If  vx  on  Xi  and  vt  on  Xt  satisfy  (11.1)  along  with  ut  and 
u .  then  there  are  numbers  a  >  0  and  b  and  real-valued  functions  f \  and  f,  on 
2)  such  that 

»i(*i)  =  <«<i(*i)  4-/i(f>(*i))  for  all  Xx  e  Xt 

=  aut(xt)  -f /*(#(**))  for  all  xt  g  Xt 

f(D)  +  ft(D)  =  b  for  all  De'D 

where  D{x^  e  D  contains  an  element  whose  ith  component  is  x{. 


Additive,  Interdependent  Expectations 


133 


For  completeness  we  should  mention  that  each  xi  e  Xt  is  assumed  to  be  the 
ith  component  of  some  x  e  X,  for  /  —  1,  2. 

Proof.  The  sufficiency  of  the  hypotheses  for  (11. 1)  is  proved  by  showing 
that  (1 1.5)  holds.  Two  cases  are  considered  for  any  D  e  2>. 

Cose  1:  D  has  no  cycles.  Fix  x°  e  D,  define  ux(x°)  and  «a(^)  so  that 
ux{t •)  +  ua(x°)  =  m(x°),  and  proceed  term  by  term  along  alternating  se¬ 
quences  beginning  at  x°,  defining  ux  and  ut  in  the  only  way  possible  to  satisfy 
(11.5).  By  Lemma  11.1,  every  xt  and  xa  in  elements  in  D  has  a  «i(«i)  or 
ut(xj  thus  defined.  Lemma  1 1.2  implies  that  the  ux  and  ut  values  are  unique, 
given  u,(x°)  and  Ka(x®). 

Case  2:  D  has  cycles.  Let  C  £  D  satisfy  the  three  conditions  of  Lemma 
11.4.  The  Case  1  proof  then  applies  to  C  and  gives  u(xu  xt)  =  u^x^)  +  ut(xj 
for  all  x  e  C.  Suppose  x  e  D,  x  $  C.  Then,  by  condition  3  of  Lemma  11.4, 
we  have  (xlt  yje  C  and  (ylt  xt)  e  C  and,  by  Lemmas  11.1  and  1 1 .2  there  is  a 
unique  alternating  sequence  in  C  from  (xls  y*)  to  (yu  x*).  Hence  C  u  {*}  has 
a  cycle  that  must  include  x.  An  alternating  sequence  whose  elements  form 
such  a  cycle  can  be  written  as  (xu  xa),  (x*,  x*), . . .  ,  (x*n,  x\n).  By  Lemma 
11.3,  (11.6)  holds  with  (x{,  x\)  as  (xt,  Xj).  Applying  u  =  ux  +  ut  to  the  C 
terms  in  the  cycle  it  follows  from  (11.6)  after  cancellation  that  u(xI,x1)  = 
«1(xi)  -f  w1(xl).  It  follows  that  u  =  +  Hj  holds  on  all  of  D. 

For  the  last  part  of  the  theorem  let  ult  ua  and  vx,  va  each  satisfy  (11.1). 
Using  the  approach  in  the  final  paragraph  of  the  proof  of  Theorem  11.1  we 
get  ul(x1)  +  vt(xt)  =  au^xj  +  aua(xg)  4-  b  for  all  xeX.  For  a  given  D  let 
x°  e  D.  The  Case  1  procedure  for  assigning  ux  and  ut  then  leads  to 

*  au^Xj)  -f  au^x\)  +  b  —  v2(x%) 
vz(x2)  =  aua(xa)  +  aw^x?)  +  b  ~  ^(xj) 

for  all  xeD.  Letting  fx{D)  =  ona(x®)  +  b  —  p2(x°)  and  fs(D)  =  aufx*)  + 
b  —  p^x®),  the  desired  equations  follow.  (A  different  x°  must  be  chosen  for 
each  D  since  there  are  no  xx  or  xa  interconnections  between  different  elements 
in  2).)  + 

113  ADDITIVE,  INTERDEPENDENT  EXPECTATIONS  FOR  n  X( 

Throughout  this  section  we  take  X  =  X{  and  let  {/l5 . . . ,  Im}  be  an 
arbitrary,  but  fixed,  nonempty  set  of  nonempty  subsets  of  {1,2,...,  n }. 
For  Section  11.1,  f  =  {}}  for  j  =  1, . . . ,  n.  Here  we  sl.all  permit  the  f  to 
overlap. 

We  shall  let  3*  be  a  set  of  probability  measures  on  X  and  let  IF*  be  a  set  of 
probability  measures  on  Xt.  With  P  e  (F,  the  rtarginal  measure  P}  of 
P  on  [Jfi  X{  is  such  that  Pj(Af)  =  P({x:x  eX,x(e  X{  for  i  £  I},  x*  e  Ax})  for 


154 


Additive  Expected  Utility 


every  Af  s  IL,  Xu  where  x}  is  the  projection  of  x  onto  I}.  For  example,  if 
X  =  (*!,  xtt  xt,  x4)  and  /,  =  {!,  3}  then  x1  —  (ib  *,). 


THEOREM  11.3.  Suppose  that  the  hypotheses  in  the  first  sentence  of 
Theorem  11.1  hold.  Then  there  are  real-valued  functions  ult . . .  ,  um  on 
IJjj  Xt, ... ,  XT/*  respectively  such  that 

lit  tn 

P  <  Q  <=>!'£(«/,  P ,)  <IE(U„  &),  for  all  P,Qe(f,  (11.7) 

i-X  i-1 

if  and  only  if[P,  Qe$,(Pu  ...  ,PJ  =  (Qu  ...  ,  QJ]=>P~Q. 

Admissible  transformations  for  the  us  are  discussed  in  Exercises  9c  and  10. 
The  proof  of  Theorem  1 1 .3  will  be  carried  out  in  two  steps.  First,  we  shall 
state  and  prove  a  lemma  and  then  use  this  to  prove  the  theorem.  In  the 
statement  of  the  lemma  we  shall  let  x°  =  (a;®, . . .  ,  a;®)  in  X  be  fixed  and,  for 
any  xeX  and  / c  {1, 2, . . . , n},  let  x[I]  be  the  n-tuple  in  X  whose  ith 
component  is  xt  if  i  el  and  x?  if  /  £  I. 


LEMMA  11.5.  Suppose  d*  contains  every  simple  probability  measure  on  X, 
and  a  real-valued  function  uonX  satisfies  P  <,  Q  <=>  E(u,  P)  <  E(u,  Q)  for  all 
simple  measures.  Suppose  further  that  [P,  Q  e  IT,  (Plt .  . .  ,  Pm)  =  (Qlt  . . .  , 
Qm)\  =>P~  Q-  Then,  for  all  xeX, 


«(*)=!(- d'+i  i 

/-I  \  L*=**  J  / 


(11.8) 


For  m  =  3,  (11.8)  is  u(x)  =  u(x[4])  +  w(x[/,])  +  w(*[4D  —  {u(xUi  H 
41)  +  «(*[4  n  hi)  +  w(x[/a  n  /,])}  +  u(z[h  n  4  n  /3]). 


Proof  of  Lemma  11.5.  To  simplify  notation  let  x  be  an  arbitrary  element 
in  X  and  let  [/]*  be  the  projection  of  x[7]  onto  /,.  (If  /  =  {1,3}  then  x[/J  = 
(Xj,  x®,  x3,  a^, , . .).  Then  if  It  *  {1,  4},  [/]'  =  {xv  x®).)  Because  the  only 
integers  in  /  that  are  relevant  in  defining  [/ff  are  those  in  If,  [/]j  =  [/  n  Iff. 
Let  S  and  R  on  {1, . . .  ,  m)  x  {I , . . .  ,  m}  be  defined  by 

S(k,j)  =  {(4, . . . ,  4):  1  ^  4  <  •  •  •  <  4  ^  m,j  g  {4, ....  4}} 
R(k,j)  =  {(4,  • ...  4):  I  ^  4  <  •  ’  *  <  4  <,  $  (4.  •  ■  •  >  4)) 

so  that  S( I  ,j)  —  {j},  R(m,j )  =  0 ,  and 


S(k,j)  u  R(k,j)  =  {(4, . . . ,  4):  1  <,  4  <  *  •  ■  <  4  <.  «},  j  »  1, . . . ,  m. 
S(k,j)  u  R(k,j)  has  j  elements. 


AdiU/vt,  Imtnkptmdtmt  ExHcmkmi 

Let  P  and  Q  be  simple  probability  measures  defined  by 


1SS 


p-as+2'  2  «fn/J 

soc.nvRfk.i)  j_j-i  *J 

2  =  1  i  «|rkl 

&K,8lk,i)URlk.»  |.»-i  J 

s  5  is  "*">•  ^=  <i:  1  :£  <•  <:  m,  i  is  Oddi,  and 
and  L„itio„sr;!'ddmarB'  °"  n"  X«  °Ur 

!■« -«<+!(  I  «fn /J'+  I  «f n  /,,]') 

xAsik.i)  L.-i  'J  nt,i)  L.-X  / 

=  **'  +  2(  I  “fn/JV  2  Jn/J') 

A'#  VR<*-l,i)  Lt-1  J  L*-l  J  / 

.  m  r  *  i  i 

=  «* +2  2«  n/J 

*- 1  R(k,i)  L*-l  J 

=  «'+el  «[/„!'+ 2  (  I  «rn/JV  2  Jn/J'i 

( U  L*-l  J  RTk.i)  L»-l  J  ) 

-  2  XKY  +  2  «(V  +  I  (  2  Jn/J 

+  2«rnJJ) 

«(t.y)  L-i  'J  / 

=  2  i  afrUr 

A-»  «(fc,y)ufl(fc.i,  L*-1  J 
=  G>- 

Hence,  by  hypothesis,  P^Q  and  therefore  E(u,  P)  *s  E(u,  Q),  or 

«(*)  +  2  2  «(4n/J)=2  2  4J7UD 

K.S(k,l)\JlHk,i)  \  L-i  J/  K»SU,j)<JR(kj)  \  La-1  *J/ 
which  is  (11.8).  + 

/Voo/  of  Theorem  11.3  ( Sufficiency ).  To  verify  (11.7)  under  the  stated 

,nclud;"«  ft . . QJ~P~g,  we  note 

first  from  Lemma  11.5  that  (1 1.8)  holds  for  u  on  X.  With  ar°  e  X  fixed  as  in 
the  lemma  we  define  u,  on  JJ7  X{  as  follows: 

“><*')  =  »<*[/,))  +'f(-i)‘  2  ulxlfu,  n ;,1\. 


156 


Additive  Expected  Utility 


uf  is  well  defined  since  u}(x f)  =  u}(yf)  if  xi  *  yK  Moreover,  if  u  on  X  is 
bounded  (as  in  Chapter  10)  then  uf  is  bounded.  Summing  over  j: 

® 2 3)  +  2  2(-i)*  2  n7>l'l 

/-i  /-i  y-i**i  \  L»-i  J/ 

-2«W/iD  +  l\-»k  2  2  «7*fn  4  n  /I) 

y-i  »-i  y-*+i  i<y,<— <<*<y  V  L#-i  J  / 

m  w-l  /  r-fc+I  -j  v 

“ 2u(xM)  +  2 (—Ok  2  •■(*  n/J) 

.-I  *- 1  \  L'-J  J/ 

“S«(*w)+i(-ir'  2 

y-i  *— 2  \  L*=i  J/ 

=  2(-Dt+1  2 

Jt-I  —  «*£m  \  L»-l  J/ 

--=u(x)  by  (11.8), 

from  which  (1 1.7)  readily  follows.  + 


11.4  PROBABILITY  MEASURES  ON  HOMOGENEOUS  PRODUCT  SETS 

Throughout  this  section  X  —  An,  O’,  is  the  set  of  simple  probability 
measures  on  X,  and  31  is  the  set  of  simple  probability  measures  on  A.  For 
P  e  3*,,  Pi  e  31  is  the  marginal  measure  of  P  on  the  ith  A :  that  is.  Pt(B)  = 
P{A*"1  x  B  x  An~')  for  B  £  A.  The  marginal  measure  of  P  on  all  but  the 
ith  A  will  be  denoted  Pei:P\[ixl, ...  ,  xi+l, . . .  ,  *„))  =  P((xu ..., 
*t~i,  a,  xi+l) xn)). 

Based  on  <  on  3“,,  we  define  <  on  3t  as  follows : 

R  <  R*  <=>P  <  Q  for  every  P,  Q  e  31,  such  that  P4  =  R  and  Qt  *  R*  for 
all  /.  Three  special  preference  conditions  will  be  applied  to  this  case: 

Cl.  [P,  Q  e  IT,,  Pt  =  Qjor  i=l . «]=>P~  Q. 

C2.  [P,  Q  e  (FJf  Pt  =  R*,P\  =  Q')^  [P  <  QoR<  R*}. 

C3.  For  some  R  e  31,  [P,  Q,  P *,  Q*  e  O’,,  P„  =  =  P*  «  gf  =  P, 


*S 


0S,Px*c  *=  0?c]=>  [P<  QoP*  <  Q* ]. 


C2  is  a  persistence  condition,  much  like  the  definition  of  persistence  in 
Section  7.1.  Under  Cl,  all  P  that  have  P<  =  R  for  r  =  1, ...» n  are  indiffer¬ 
ent,  and  all  £?  that  have  {?,  =  P*  for  all  i  are  indifferent.  Hence  if  <  on 
is  a  weak  order  then,  if  P<  Q  for  one  such P and  Q,  P  <  Q  for  all  such  P 
and  Q  so  that  <  on  3t  is  a  “faithful”  weak  order.  C2  says  that  this  weak  order 
on  31  applies  to  each  of  the  n  factors.  C3  is  a  form  of  stationary  condition, 
and  compares  with  stationary  as  defined  in  Definition  7.3  of  Section  7.3. 


Summary  157 

The  reasonableness  of  these  conditions  is,  of  course,  doubtful  in  most 
situations. 

THEOREM  1 1.4.  Suppose  that  there  is  a  real-valued  function  u  on  X  ~  An 
that  satisfies  F  <  Q  o  E(u ,  P)  <  E(u,  Q),for  all  P,  Q  e  that  P  <  Q  for 
some  P,  Q  e  (f„  and  that  Cl  and  C2  hold.  Then  there  is  a  real- valued  function 
p  on  A  and  positive  numbers  Xlt ...  ,X„  such  that 

P<Qot  XiE(p>  P d  <  i  XiE(p ,  Q{ ),  for  all  P,Qe  (11.9) 

i-i  <-i 

and  p  on  A  and  positive  X[, . . . ,  X'n  satisfy  (1 1.9)  along  with  p  and  Xlt . . . ,  Xn 
if  and  only  if  there  are  real  numbers  p  >  0,  q  >  0  and  r  such  that 

X't  =  pXl  for  i  *  1, . . . ,  n  (11.10) 

p'(a)  =  qp{a)  +  r  for  all  a  e  A.  (1111) 

If,  in  addition,  C3  holds  and  n  )>  2  then  there  is  a  unique  number  n  >  0  such 
that 

P<Qot  Pi)  <  I  Qt)>  M  all  P,Qe  <S,.  (11.12) 

i-i  i-i 

Expression  (11.9)  compares  with  (7.9)  and  (11.12)  compares  with  (7.13). 

Proof.  To  obtain  (11.9)  we  use  Theorem  11.1  to  obtain  (11.1)  for  all 
P,Qe  ‘St,  where  each  u,  is  defined  on  A.  Cl  is  used  in  this.  It  then  follows 
from  C2  and  the  definition  of  <  on  &  that,  for  each  /,  R  <  R*  o  E{uit  R)  < 
E(u(,  R*)  for  all  R,  R*  e3l.  It  follows  from  Theorem  8.4  that  the  «,  are 
related  by  positive  linear  transformations,  say  u,  =  atux  +  b}  with  a,  >  0  for 

j  a=  2 . n.  Let  p  s  i/t  and  Xx  =-  1 .  Xt  —  at  (orj  =  2, . . .  ,  n.  Then  (1 1.9) 

follows. 

Suppose  p'  and  A,'  >  0  satisfy  (11.9)  also.  Then,  since  the  X,p  are  unique 
up  to  similar  positive  linear  transformations  by  Theorem  11.1,  there  are 
numbers  k  >  0  and  such  that  X^p  =*  kXip  +  for  /  =  1 

(11.10)  and  (11.11)  then  follow  as  in  the  proof  of  Theorem  7.4.  P  <Q 
for  some  P,Qe$t  is  used  in  obtaining  (11.10), 

The  proof  for  (11.12)  follows  the  general  lines  given  in  the  proof  of 
Theorem  7.5  and  will  not  be  detailed  here.  + 

11.5  SUMMARY 

When  X  s  the  usual  expected  utility  axioms  along  with  a 

condition  that  says  that  P~Q  when  the  marginal  measure  of  P  for 
equals  the  marginal  measure  of  Q  for  Xt  (/  —  1 , . . . ,  n)  leads  to  the  additive 


158 


Additive  Expected  Utility 


form  P  <  Q  £(»<,  P,)  <  £(wo  Qi)-  This  was  proved  in  general  for 
x  =  Uux<  and  for  X  £  Xx  x  (It  is  true  aiso  for  simple  measures 
when  X  c  m.,*.  but  the  proof  of  this  was  discovered  too  late  for  inclusion 
here.) 

Under  the  additive,  expected  utility  representation  in  the  homogeneous 
context  with  X  =  yf",  a  persistence  condition  leads  to  P  <  Qo%i  E(p , 
P<)  <  A4£(p,  (?,),  and  persistence  and  stationarity  lead  to  P  <  Qo 

A)  <  2i) 


INDEX  TO  EXERCISES 

i.  50-50  indifference  condition.  2.  Binary  relations  on  the  3V  3.  Marginal  expectations. 
4.  Additivity  with  finite  X  £  II  Xt.  5-8.  Alternating  sequences  and  cycles.  9.  Markovian 
dependence  in  utility  theory.  10.  Admissible  transformations.  11.  Theorem  11.3  versus 
Theorem  11.2.  12.  No  time  preference.  13-14.  Theorem  11.4. 


Exercises 

1.  For  the  bridge-construction  example  of  Section  10.1  let  x1  be  cost  and  let  x% 
be  completion  time  in  (*lt  *2)  e  Xx  x  X2.  Assume  that  both  factors  are  subject  to 
uncertainty.  With  X  =  Xx  x  X2  argue  that  only  50-50  gambles  of  the  following 
form  need  to  be  used  in  testing  the  indifference  condition  of  Theorem  11.1 :  P  gives 
($100  million,  4  years)  or  (xx  million,  x2  years)  each  with  probability  ,5;  Q  gives 
($100  million,  x2  years)  or  (xa  million,  4  years)  each  with  probability  .5. 

2.  Let  $  be  the  set  of  simple  probability  measures  on  X  —  XJ"_i  ■*<  and  let  £T4 
be  the  set  of  simple  probability  measures  on  AVWith  a,  b  e  !T(  define  a  <liboP  =< 
Q  for  every  P,  Q  e  such  that  P<  =  a,  Q,  =■  b,  and  Pf  =  where  Pf(Q')  is  the 
marginal  of  P(Q)  on  Th+i  Xi .  Also  leta  <ib<?>(fl  <<6, noth  <i«),a  ~xbo 
(a  <<  b,  b  < i a).  We  identify  the  following  conditions: 

A.  <  on  O'  is  transitive  and  connected; 

B.  (P  <  Q,  0  <  a  <  1)  =>  aP  +  (1  —  a )R  -<  aQ  +  (1  —  a )R; 

C.  (P  ~  Q,  0  <  a  <  1)  =>  aP  +  (1  —  a)R  ~  <tQ  +  (1  —  a)P; 

D.  (P<  =  Qi  for  /  «  1, . . .  ,  n)  =>  P  ~  Q, 

where  P  ■<  Q  o  (P  <  Q,  not  Q  <  P)  and  P  ~  Q  o  (P  <  Q,  Q  <  P).  Prove  the 

following  theorems.  The  means  “does  not  imply.” 

a.  (^  on  IT  is  transitive)  =>  (<*  on  3,  is  transitive). 

b.  (Pi  Qi,  P*  =  QJ)  =>  P  ~  G. 

c.  (Pi^Qi'Pt-QV&P  <Q 

d.  (<  is  transitive,  P,  ■<*(?<  for  all  i)  =>  P  <  Q. 

e.  (A,  each  on  ^  is  transitive  and  connected,  P,  <t-  Qt  for  all  i,  P<  <  Qt  for 
some  i)  P  <  Q. 

f  D  <  on  J  and  on  are  reflexive. 


Exercises 


159 


g.  (it  —  2,  <(  is  reflexive  for  /  **  1,  2)  =>  D. 

h.  (n  >  2,  <<  is  reflexive  for  each  /)  D. 

i.  ( n  ^  2,  is  reflexive  for  each  i,  ^  is  transitive)  =>  D. 

./•  04,  £>)  #>  on  is  transitive  and  connected. 

k.  K  is  transitive,  B,  C,  D,  Pi  -  a,  &  =  6,  p«  =  Q«,  o  ■<, b)  =>  P  <  Q. 

L  (<  is  transitive,  Bt  C,  D,  Pi<tQt  for  all  i,  Pt  <t  Q{  for  some  i)=>P<  Q. 

m.  When  C  in  k  and  /  is  replaced  by  A,  the  conclusions  of  the  two  theorems  can 
be  false. 

n.  (A,  B,  C,  D)  =>  on  !T,  is  transitive  and  connected. 

o.  (A,  B,  C,  D ,  Pi  <i  Qi}  0  <  a  <  1)  =>  ctP,  +  (1  -  a)Rt  <<  +  (1  -  «)jf 

p.  (A,  B,  C,  D,  Pi  <i  Qit  0  <  a  <  1)  =>  rtPi  -f  (1  -  a)/?,,  aQ<  +  (1  _  a)/?, 

3.  With  A'  £  XT?-i  let  P,-  be  the  marginal  measure  on  Xt  of  the  probability 
measure  P  on  X  and  let  f  on  X  and  fi  on  Xi  be  real-valued  functions  that  satisfy 
fixi,  i  xn)  =  fi(xi)  for  all  x  eX.  Prove: 

o.  (P  is  simple,  X  =  TT  *,)  =>  E(f,  P)  -  E(f,  Pt). 

b.  (P  is  simple,  X  s  JJ  *i)  =►  ^(/,  P)  -  £(/<,  A). 

c.  (/i  is  bounded,  JIT  s  JJ  A;)  =>  £(/,  P)  =  £■(/,,  P,). 

4.  Suppose  that  !F  is  the  set  of  simple  probability  measures  on  a  finite  set  X  <= 

n?.i  *  ,  that  there  is  a  real-valued function  u  on  X  that  satisfies  P  <Q  o  E(u,  P)  < 
E(u,  Q)for  every  P,Qe$,  and  that  (P.fief,  P,  =  QJor  i  _  1  P  ~  Q. 

Then  there  are  real-valued  functions  on  Xlt . . .  ,  X„  respectively  that 

satisfy  (11.1). 

Prove  this  theorem  using  the  following  steps. 

a.  To  establish  u(xlt . . , ,  xn)  =  2  «,•(*<)  for  each  x  e  X  note  that  this  system  of 

equations  is  the  same  as 


2  aiMVk)  =  «(**) 


k~l 


1 . M 


(11-13) 


when  we  let  X  =  {xl . X<  =  . (y, . „*)  . 

(xu, . . . ,  arlmi,  x21, . . .  ,  *nl, ....  xnnJ,  N  =  2JL,  and  define  the  ajk  e 
{0, 1}  in  an  appropriate  manner  with  2*1 1  <*ik  =  n  for  each  j. 

It  is  a  well-known  fact  of  linear  algebra  that  (1 1 . 1 3)  has  a  ^-solution  if  and  only 
if  for  any  non-zero  vector  (clf ....  cM)  e  Re" 

/"  \  M 

-  0  for  k  =  1, . . . ,  NJ  =>  2  cju(x*)  *  0  (ii.14) 


To  verify  this  for  a  non-zero  (Cl, . . . ,  cM)  let  A  =  {j\cf  >  0},  B  -  {/:c,  <  0), 

p  -2. a  (.csI2a  ci&  and  fi  =  (^/2b  and  show  that  the  left  side  of 

(11.14)  implies  that  A  *  0  and  B  ft  0 ,  that  P<  »  Qt  for  1  «  1 . n,  and 

that  2^  =  —  2b  cy  Then  use  the  indifference  condition  to  establish  (1 1.14). 

5.  In  the  X  £  Xt  x  X2  context  of  Section  11.2  verify: 

a.  R  in  Definition  11.2  is  an  equivalence. 

b.  Lemma  11.1.  (Consider  a  shortest  sequence.) 

c.  A  cycle  has  at  least  four  elements. 


■6.  Let  X  =*  {(*1,  Xg),  (xlt  2j),  (j/1(  *->),  (t/j,  yt),  (*j,  (xj,  *,)},  and  let  u  on  X 


160  Additive  Expected  Utility 

satisfy  P  <Qo  E(u ,  P)  <  E(u,  Q )  for  a  probability  measures  P,  Q  on  X.  Prove: 

a  A'  is  a  cycle. 

b.  X  has  no  four-element  cycle. 

c.  The  50-50  indifference  condition  of  Theorem  1 1.1  holds. 

d.  (11.1)  can  be  false. 

7.  Prove  Lemma  1 1 .2  by  showing  that  if  D  has  more  than  one  alternating  sequence 
from  (xlt  xt)  to  (t/j,  y2)  then  2>  includes  a  cycle. 

8.  Prove  Lemma  11.3. 

9.  In  the  context  of  Section  11.3  let  /,  =  {/,  /  +  1}  for  i  =  1,2 . /*  —  1  with 

m  =  n  —  1. 

a.  Ve.ify  that  the  indifference  condition  at  the  end  of  Theorem  11.3  implies  the 

following:  [{(*,,  xi+1),  (y(,  yi+1)}  =  {(z„  zi+1),  (w„  w<+1)}  for  /  -  1 . n  - 

1]  =>  \x  +  ~  ^2  4-  £k».  (The  latter  are  50-50  gambles.) 

o.  Prove  that  Theorem  11.3  is  true  for  the  case  at  hand  when  the  indifference 
condition  in  Theorem  11.3  is  replaced  by  the  50-50  indifference  condition  in 
(a).  Obtain  u(x)  =  u,(x„  x<+1). 

c.  With  P  <  Qo E(u ,  P)  <  E(u,  (?)  for  all  P,Qg‘S,  suppose  that  w(x)  = 
}  n, •(*,•,  xJ+1)  *=  u,(*„  x,+1)  for  all  x  e  X.  Show  that  there  are  real¬ 

valued  functions /2, . . .  ,fn_x  on  Af2. . . . ,  such  that 

Vi(a,  b )  =  uy(a,  b )  +  f2(b)  for  all  (a,  b)eXx  x  A"2, 

*’<(«,  *)  =  «<(*,  b)  - fi(a )  +  fH  x(b)  for  all  (n,  b)  e  AT<  x  AT<+1, 

2  ^  i  £  n  -  2, 

6)  =  W  "/«-  l(o)  for  a|l  K  W  e  ATb_j  x  Af„. 

10.  In  the  context  of  Section  11.3  suppose  that  «(x)  =  Uj(xi)  for  all  xs 

IB-i*.  as  in  the  proof  of  Theorem  11.3.  With  u  fixed,  describe  the  set  of  trans¬ 
formations  on  the  Uj  that  preserve  equality.  Note  that,  if  {ry}  is  such  a  transformation 
of  {«,}  then  ^  ui(xi)  =  t>,(xO  and  consequently  vfc([/,]*)  =  2?U  for 

j  —  1 , .  . . ,  m  so  that 

vt&)  =  «,(*’)  +  2  MI/,]*)  ~  »*(!/,]*)]. 

k^i 

If  If  n  lk  =  0 ,  argue  that  «*([/,}*')  —  »>*([/,]*)  is  constant  as  x  ranges  over  X, 
and  if  I}  n  Ik  9*  0  then  the  stated  difference  varies  as  x  ranges  over  X  but  the 
variation  is  caused  only  by  the  x,  for  /  6  lt  n  4. 

11.  Argue  that  if  the  generalization  of  Theorem  1 1 .2  were  true  for  X  £  ns.i  Xi 
with  n  >  2,  then  Theorem  11.3  for  simple  measures  would  be  an  immediate  corollary 
of  the  more  general  form  of  Theorem  1 1.2. 

12.  Show  that  the  hypotheses  in  the  first  two  lines  of  .Theorem  11.4  along  with 

(Px, . . .  ,  P„  is  a  permutation  of  Qx . Q„)  =>  P  ~Q,  imply  that  there  is  a 

real-valued  function  p  on  X  such  that,  for  all  P,Qe  sr,,  P  <  Q  o  2<  £(/>»  P»)  < 
Ii£(p,  Qil 

13.  Verify  (11.12)  in  Theorem  11.4. 

14.  Can  you  imagine  a  situation  in  the  context  of  Section  11.4  where  any  one  of 
Cl,  C2,  and  C3  seems  reasonable  with  n  >  1  ? 


PART 


hi 

STATES  OF  THE  WORLD 


Preference  structures  that  incorporate  uncertainty  in  the  formulation  of 
alternatives  but  do  not  presuppose  probability  have  been  expressed  mostly 
in  states  of  the  world  models.  In  such  a  model  the  uncertainty  concerns  which 
state  in  a  set  of  mutually  exclusive  states  (or  environments)  obtains,  or  is  the 
“true  state.”  It  is  generally  assumed  that  (1)  the  decision  maker  does  not 
know  the  “true  state,”  (2)  the  act  he  selects  has  no  effect  on  the  state  that 
obtains,  and  (3)  the  state  that  obtains  affects  the  outcome  of  the  decision  in 
conjunction  with  the  act  selected. 

Interest  in  expected-utility  theories  that  are  set  in  the  states  of  the  world 
formulation  is  due  in  large  part  to  Leonard  J.  Savage’s  theory  (Chapter  14), 
published  in  1954.  Before  this,  the  now  widely-referenced  theory  of  Frank 
P.  Ramsey  (1931)  was  virtually  unknown.  Savage’s  theory  reflects  elements 
from  Ramsey  and  from  John  von  Neumann  and  Oskar  Morgenstern:  his 
interpretation  of  probability  owes  much  to  the  pioneering  work  of  Bruno  de 
Finetti. 

Some  other  theory  for  the  states  formulation  is  presented  in  Chapters  12 
and  13. 


Chapter  12 

STATES  OF  THE  WORLD 


This  chapter  introduces  the  states  of  the  world  formulation  for  decision  under 
uncertainty.  The  first  section  describes  the  usual  states  formulation  and 
compares  it  with  the  approach  of  Part  II.  The  second  section  examines  the 
weak-order  expected-utility  model  for  the  states  formulation,  and  discusses 
several  axiomatic  approaches  to  the  model.  Several  of  these  approaches  are 
explored  in  the  next  two  chapters. 

The  second  section  also  points  out  two  problems  that  arise  in  the  theories. 
One  of  these,  often  referreri  to  as  the  “constant  acts”  problem,  suggests  an 
alternative  approach  to  the  expected-utility  model.  Axioms  for  the  alternative 
approach  have  yet  to  be  discovered.  The  second  problem  concerns  the 
fineness  of  state  descriptions  and  residual  uncertainty.  Some  additive  utility 
models  that  are  designed  for  this  possibility  and  which  do  not  explicitly 
include  state  probabilities  are  discussed  in  the  third  section. 

12.1  STATES  AND  STATES 

In  Part  II  of  this  book  we  thought  of  a  decision  under  uncertainty  in  terms 
of  a  set  F  of  available  acts  or  strategies  and  a  set  X  of  consequences,  one  of 
which  will  follow  from  the  selected  act.  We  assumed  that  the  decision  maker’s 
uncertainty  about  which  x  e  X  would  occur  if  f  e  F  were  selected  could  be 
expressed  by  a  probability  measure  Pf  on  X.  The  axioms  were  based  on  sets 
of  probability  measures  that  supposedly  included  {Ff  .f  e  F}. 

To  enlarge  on  this  let  S'  be  the  set  of  functions  on  acts  to  consequences. 
Each  se  S'  assigns  a  consequence  s(f)  e  X  to  each  f  e  F.  Suppose,  for 
example,  that  a  young  man  will  propose  marriage  to  either  Alice  or  Betsy, 
but  not  both  in  case  one  refuses  him.  Suppose  further  that  he  is  interested 
only  in  the  three  consequences  in  {Marry  Alice,  Marry  Betsy,  Stay  Single). 
In  this  case  S'  contains  nine  functions,  but  only  four  of  these  need  be  con¬ 
sidered.  The  four  are  {(Propose  to  Alice,  Marry  Alice),  (Propose  to  Betsy, 
Marry  Betsy)},  {(Propose  to  Alice,  Marry  Alice),  (Propose  to  Betsy,  Stay 


163 


164 


States  of  the  World 


Single)},  {(Propose  to  Alice,  Stay  Single),  (Propose  to  Betsy,  Marry  Betsy)}, 
and  {(Propose  to  Alice,  Stay  Single),  (Propose  to  Betsy,  Stay  Single)}.  One 
of  the  five  functions  that  is  excluded  is  {(Propose  to  Alice,  Marry  Betsy), 
(Propose  to  Betsy,  Marry  Alice)}. 

Suppose  that  if  act  /is  implemented  then  consequence  s(f)  will  occur  and 
this  is  true  for  each  f  e  F.  Then  we  say  that  s  obtains.  By  C'  £  S'  obtains  we 
mean  that  some  s  e  C  obtains.  Suppose  the  decision  maker  has  a  probability 
measure  P'  on  (the  set  of  subsets  of)  S'.  P'(C')  is  interpreted  as  a  measure  of 
his  belief  in  the  truth  of  the  proposition  "C  obtains.”  Given  P'  we  could 
define  P}  by 

pf(A)  =  />'({j:i  e  5',  s(f)  e  A})  for  all  A  £  x.  (12.1) 

In  the  marriage  example  we  would  expect  that  P'(C')  —  0  when  C'  is  the  set 
of  the  five  “excluded”  functions.  Then  we  would  have  P'  (either  girl  would 
say  “yes”)  +  P'  (only  Alice  would  say  “yes”)  +  P'  (only  Betsy  would  say 
“yes”)  +  P‘  (neither  girl  would  say  “yes”)  *=  1.  Here  we  have  translated  the 
four  functions  into  conditions  under  which  they  will  obtain.  For  example 
{(Propose  to  Alice,  Marry  Alice),  (Propose  to  Betsy,  Stay  Single)}  obtains 
if  and  only  if  only  Alice  would  say  “yes.” 

If  se  S'  obtains  it  will  obtain  regardless  of  which /is  implemented.  This 
is  a  result  of  the  way  S’  has  been  formulated.  Hence  the  decision  maker’s 
choice  should  not  influence  his  beliefs  about  which  s  might  obtain.  But  we 
expect  that  his  beliefs  about  which  s  might  obtain  will  influence  his  choice. 

In  most  cases  P'  on  S'  contains  more  information  about  the  decision 
maker’s  uncertainty  than  does  {Pf  :f  e  F}  when  the/1,  are  probability  measures 
defined  from  P'  as  in  (12.1).  To  determine  an  act  in  £that  maximizes  expected 
utility  it  is  usually  unnecessary  to  estimate  all  of  P\  a  task  that  may  be  an 
order  of  magnitude  more  difficult  than  the  estimation  of  the  Pr 

Although  four  potentially  nonzero  P\s)  were  noted  in  the  marriage 
example,  our  young  man  would  presumably  be  satisfied  with  estimating  the 
two  probabilities  p  —  Ppropoae  to  Alice  (Marry  Alice)  —  Ppcopo»e  to  Alice 
(Alice  would  say  “yes”)  and  q  *  PpropoM  to  JJCt8y  (Marry  Betsy)* 
Propose  to  Betsy  (Betsy  would  say  “yes”).  In  fact,  all  he  needs  is  an  estimate 
of  the  ratio  pjq  since  £(u,  Propose  to  Alice)  <  E(u,  Propose  to  Betsy)  <=> 
pjq  <  [«(Marry  Betsy)  —  w(Stay  Single)]/[w(Marry  Alice)  —  w(Stay  Single)}. 

States  of  the  World 

In  Savage’s  words  (1954,  p.  9)  the  world  is  “the  object  about  which  the 
person  is  concerned”  and  a  state  of  the  world  is  “a  description  of  the  world, 
leaving  no  relevant  aspect  undescribed.”  The  states  are  to  incorporate  all 
decision-relevant  factors  about  which  the  decision  maker  is  uncertain  and 
should  be  formulated  in  such  a  way  that  the  state  that  obtains  does  not 
depend  on  the  act  selected. 


States  and  Staten 


165 

According  to  the  last  part  of  this  description  it  would  not  seem  out  of 
place  to  call  the  elements  in  S'  “states."  However,  the  approach  made 
popular  by  Savage  and  others  does  not  usually  proceed  in  this  way.  Instead 
of  defining  states  as  functions  on  acts  to  consequences,  Savage  defines  acts 
as  functions  on  states  to  consequences.  With  5  the  set  of  states  of  the  world, 
each f  £  F  is  a  function  on  S  to  X:f(s )  is  the  consequence  that  occurs  if /is 
implemented  and  s  e  S  obtains. 

Simple  examples  of  states  as  they  are  often  thought  of  in  the  Savage 
approach  are:  whether  an  unbroken  egg  (the  world)  is  good  (state  1)  or 
rotten  (state  2);  whether  the  next  flip  of  this  coin  will  result  in  a  head  (sx)  or 
a  tail  (sg);  whether  the  accused  is  guilty  fo)  or  innocent  ($*);  whether  these 
mushrooms  are  harmless  (si)  or  poisonous  (fg).  Iff  —  “Eat  the  mushrooms” 
and  g  =  “Throw  away  the  mushrooms"  then  / (Si)  =  “Enjoy  a  culinary 
treat,” /(sa)  =  “Enjoy  a  culinary  treat  then  die,”  and  g(s,)  *  £($,)  = 
“Throw  away  the  bunch  of  mushrooms.” 

If  S  is  so  formulated  that  at  most  one  s  e  S  can  obtain,  the  decision  maker 
cannot  conceive  of  none  of  them  obtaining,  and  the  state  that  obtains  does 
not  depend  on  the  act  selected,  then  we  might  suppose  that  the  decision 
maker  has  a  probability  measure  P*  on  S  where  P*(C)  is  his  probability  that 
some  seC  with  C  £  S  obtains.  We  would  then  define Pf  by 

Pf(A )  =  P*({s:s  e  S,f(s)  e  A})  for  all  A  £  X.  (12.2) 

If  in  fact  subsets  of  S  are  more  or  less  probable  depending  on  which  f  eF is 
chosen,  then  new  states  defined  as  functions  on  F  into  S  will  remove  this 
difficulty.  In  most  discussions  based  on  Savage’s  theory  it  is  presumed  that 
(12.2)  holds. 

Comparisons  of  Two  Formulations 

The  rest  of  this  book  is  primarily  concerned  with  utility  theory  based  on 
Savage’s  conception  of  decision  under  uncertainty.  Before  we  get  into  that 
it  seems  advisable  to  note  that  the  two  formulations  presented  above  are  not 
incompatible.  In  fact,  they  are  virtually  isomorphic  when  a  certain  consistent 
way  of  viewing  uncertainties  is  adopted.  A  demonstration  of  this  follows. 

Whether  S'  and  S  as  conceived  of  above  appear  different,  at  least  on  the 
surface,  suppose  in  fact  that  their  probability  measures  P'  and  P*  agree  with 
each  other.  By  this  we  mean  that,  for  any  A  r=  X  and  feF, 

P'({s':s'  e  S',  s' {f )  e  A})  =  P*{{s:s  e  S,f(s)eA}).  (12.3) 

This  says  that  the  decision  maker’s  probability  of  getting  an  x  e  A  when  /  is 
used  is  independent  of  the  particular  method  used  to  describe  his  uncertainty. 

Let  u  on  X  be  the  point  utility  function  defined  in  such  a  way  that  for  any 
two  measures  P  and  Q  on  X,  P  <  Q  o  E(u,  P)  <  £{u,  Q).  We  assume  (see 
Chapter  10  if  X is  infinite)  that  u  on  X is  bounded.  Let  wx,  wa, ...  be  a  sequence 


166 


States  of  the  World 


of  simple  functions  on  X  that  converges  uniformly  from  below  to  *>  (Definition 
10. 11).  Consider  one  of  these,  say  u*.  Let  un  have  m  values  with  un(A()  «=  ct 
for  /  =  1 , . . . ,  m  where  { Alt ....  Am }  is  a  partition  of  X and  let 

Cl  —  {.s'  :s'  6  S\  s'(f )  e  A{},  Ct  =  { s:s  e  S,f  (s)  e  At}. 

Then  {C[, ....  C'm}  and  {Cl5 ...»  Cm)  are  partitions  of  S'  and  S  respectively 
and,  by  (12.3),  2,  cfiC')  =  2  *(£<)■  It  follows  from  Definition  10.12 

that 

E[u(,r(f)),  P']  =  £[u(/(i)),  P*],  (12.4) 

where  A  denotes  the  varying  factor  under  P'  or  P*.  In  terms  of  (12.1)  the 
left  side  of  (12.4)  is  E(u,  Pf).  In  terms  of  (12.2)  the  right  side  of  (12.4)  is 
E{u,P,). 

Hence,  under  the  agreement  of  (12.3),  the  two  formulations  give  the  same 
value  for  the  expected  utility  of  act /. 

12.2  EXPECTED  UTILITY  PREVIEW 

In  viewing  acts  as  functions  on  states  to  consequences,  we  shall  be  con* 
cerned  with  the  expected*utility  model 

/ <  g  o E[u(f{s)),  P*]  <  E[u(g(s)), P*],  for  all/,  gcF,  (12.5) 

where  P*  is  a  probability  measure  on  the  set  of  all  subsets  of  S  and  u  is  a 
utility  function  on  X, 

A  number  of  axiomatizations  of  (12.5)  have  been  made.  By  far  the  best 
known  of  these  is  Savage’s  theory  (1954),  all  of  whose  axioms  can  be  stated 
in  terms  of  <  on  F.  His  axioms  require,  among  other  things,  that  S  be 
infinite  and  that  if  B  s  5  and  0  ^  p  <,  1  then  pP*(B )  =  P*(C)  for  some 
C  £  B.  He  assumes  also  that  every  element  in  X  can  occur  under  each  state 
and  that  ail  constant  acts — those  that  assign  the  same  consequence  to  every 
state — are  in  F.  His  reason  for  doing  this  is  to  provide  a  way  of  defining 
preferences  among  consequences  on  the  basis  of  preferences  among  (constant) 
acts.  Moreover,  this  enables  the  derivation  of  a  probability  measure  P*  on 
S.  Savage’s  theory  will  be  presented  in  detail  in  Chapter  14. 

One  of  the  most  criticized  aspects  of  Savage’s  theory  is  the  structural 
condition  that  all  of  X  is  relevant  under  each  s  e  S.  For  the  general  situation 
let  X(s)  denote  the  subset  of  consequences  that  might  actually  occur  under 
the  acts  in  F  if  s  obtains.  Then,  viewing  the  consequences  as  complete 
descriptions  of  what  might  occur,  it  would  not  seem  unusual  to  have 
X(s)  n  X(s’)  =  0  when  s  s’.  When  this  is  so,  there  is  no  natural  way  of 
defining  preferences  on  consequences  in  terms  of  preferences  on  acts.  This 
suggests  an  alternative  approach  to  (12.5)  that  is  based  on  a  pair  of  preference 
relations,  <  on  £and  < '  on  X.  In  this  approach  we  would  be  interested  in 


Expected  Utility  Preview 


167 


conditions  for  <  and  <'  that  imply  the  existence  of  a  real-valued  function  u 
on  X  ss  X(s)  and  a  probability  measure  P*  on  S  that  satisfy  (12.5)  along 
with  x  <'  y  <=>  u(x)  <  u(y),  for  all  x,  y  e  X.  I  do  not  presently  know  of  any 
axiomatizatir  i  that  does  this,  even  when  X  and  F  are  finite,  and  allows  for 
no  overlap  of  the  X(s). 

Extraneous  Probabilities 

In  addition  to  Savage’s  approach  to  (12.5),  a  number  of  authors  have 
developed  theories  that  use  a  set  of  extraneous  probabilities  in  the  axioms. 
These  probabilities  may  have  nothing  to  do  with  P*  which,  like  u,  is  to  be 
derived  from  the  axioms.  Conceptually,  the  extraneous  probabilities  can  be 
associated  with  the  outcomes  of  chance  devices  such  as  roulette  wheels,  dice, 
or  pointers  spun  on  circular  disks.  The  axioms  in  these  cases  apply  <  to  a 
set  of  elements  constructed  from  F  and  the  extraneous  probabilities.  The 
set  to  which  <  is  applied  includes  F  as  a  special  subset. 

Axioms  for  (12.5)  that  use  extraneous  probabilities  from  0  to  1  have  been 
presented  by  Chernolf  (1954),  Anscombe  and  Aumann  (1963),  Pratt,  Raiffa, 
and  Schlaifer  (1964),  Arrow  (1966),  and  Fishburn  (1969).  The  next  chapter 
examines  two  versions  of  this  theory.  The  first,  which  assumes  that  S  is 
finite,  follows  Pratt,  Raiffa,  and  Schlaifer  and  assumes  only  a  minimal 
overlap  among  the  X(s)  for  different  s  e  S.  This  overlap  is  necessary  in  order 
to  have  a  base  on  which  to  define  P*.  The  second  theory  makes  no  restrictions 
on  the  sizes  of  S  and  X,  but  it  does  assume  that  X(s)  «=  X  for  all  s  as  in 
Savage’s  theory.  However,  unlike  Savage’s  theory,  almost  no  restrictions  are 
placed  on  P*. 

Axioms  for  (12.5)  that  use  only  the  extraneous  probability  1/2  (or  the 
notion  of  even-chance  gambles)  have  been  developed  by  Suppes  (1956). 
Suppes’  theory  can  be  viewed  as  a  logical  completion  of  Ramsey's  (1931) 
ideas.  Suppes  (1956)  should  be  consulted  for  a  more  detailed  account.  Some 
of  the  50-50  theory  is  presented  in  the  exercises  of  Chapter  13. 

Residual  Uncertainty  and  Act-State  Pairs 

In  practice  it  is  seldom  possible  to  ensure  that  the  states  will  leave  no 
relevant  aspect  of  the  world  undescribed.  No  matter  how  finely  we  describe 
the  potential  realizations  of  the  world,  the  descriptions  will  usually  be 
incomplete  even  when  the  states  meet  the  logical  criteria  of  being  mutually 
exclusive  and  collectively  exhaustive.  Thus  the  specification  of  act / and  state 
j  will  enable  us  to  say  something  about  what  will  occur  although  we  may 
never  be  precisely  certain  about  exactly  what  will  happen  if / is  implemented 
and  s  obtains.  Part  of  this  residual  uncertainty  can  be  identified  explicitly  by 
expanding  S  to  obtain  a  finer  set  of  states.  This  may  necessitate  an  expansion 
of  Falso. 


168 


Stales  of  the  World 


The  practical  question  is  thus  seen  as  the  question  of  how  detailed  to  make 
the  states  in  light  of  the  purpose  of  the  decision  and  the  import  of  the 
potential  consequences. 

The  possibility  of  residual  uncertainty  (given  /  and  s  we  are  still  not 
precisely  certain  about  what  will  happen)  leads  us  to  consider  a  formulation 
that  does  not  attempt  to  detail  exact  consequences.  In  this  formulation 
consequences  f(s)  are  replaced  by  act-state  pairs  (/,  s)e F  x  S.  Uncertainties 
not  resolved  by  simply  specifying  act-state  pairs  might  be  mentally  factored 
into  the  situation  by  the  decision  maker  during  his  preference  deliberations. 

In  this  case  no  act-state  pair  appears  under  more  than  one  state.  Thus  we 
have  the  kind  of  situation  described  above  where  X(s)  n  X(s')  =  0 .  With 
u  a  utility  function  on  F  x  S  in  the  present  formulation,  we  might  ask  for 
conditions  for  a  binary  relation  <  on  F  and  a  binary  relation  <'  on  F  x  S 
that  imply  the  existence  of  a  real-valued  function  u  on  F  x  S  and  a  proba¬ 
bility  measure  P*  on  S  such  that 

(/*<*)<'  (£»  0  <=>«(/ J)  <  «(£.0.  for  all  (g,t)e F  x  S,  (12.6) 
f<  g  o E[u(f, s ), P*)  <  E[u(g,  s ), P*],  for  all/, geF.  (12.7) 

I  do  not  presently  know  of  any  more-or-less  satisfactory  axiomatization  for 
this  model. 

12.3  MODELS  WITHOUT  STATE  PROBABILITIES 

Despite  the  absence  of  axioms  for  the  F  x  S  model  of  (12.6)  and  (12.7) 
we  can  formulate  axioms  for  more  general  but  perhaps  somewhat  less 
interesting  forms  of  that  model.  These  forms  posit  additivity  over  the  states 
but  make  no  attempt  to  define  state  probabilities.  I  shall  comment  briefly 
on  several  of  them.  These  comments  apply  also  to  the  consequence  formula¬ 
tion  when  X(s)  n  X(s')  =  0  whenever  s  ^  s'.  Throughout  this  section  both 
Fand  S  are  assumed  to  be  finite. 

An  Order  for  Each  State 

In  our  first  case  we  assume  that  the  decision  maker  has  a  weak  order  < 
on  F along  with  a  weak  order  <,  on  F for  each  s.  Thus,  <,  orders  Funder 
the  hypothesis  that  s  obtains.  If  the  decision  maker  does  in  fact  have  a  weak 
order  c'onFx  S,  then  <,  would  be  obtained  as  the  restriction  of  <’  to 
F  x  {$}.  In  this  context  we  are  interested  in  the  existence  of  a  real-valued 
function  i>  on  F  x  S  that  satisfies 

/  <sgo  v(f,  s)  <  v(g,  s),  for  all/,  geF  and  s  e  S, 
f  <  g  <=>  2s  <’(/  s)  <  J,s  v(g,  s),  for  all  /  geF. 


(12.8) 

(12.9) 


Models  without  State  Probabilities 


m 

Under  the  weak  order  conditions,  an  independence  axiom  across  states  that 
is  necessary  and  sufficient  for  (12.8)  and  (12.9)  can  be  derived  from  the 
Theorem  of  The  Alternative  (Theorem  4.2).  One  version  of  such  an  axiom  is: 
if  p  <  g1  (i .e.,P  <  g*  or  fi  ~  g})  for  j  —  1, . . . ,  m  and  if  for  each  s  there 
is  a  permutation  /**, . . .  ,/*m  of  /*,... ,/"'  such  that  g1  <,/*'  for  j  = 
1, . . . ,  m,  then  in  fact  P  ~g*  and  g*  pJ  for  all  j  and  s. 

Apart  from  the  question  of  intransitive  indifference,  one  can  criticize  the 
model  of  (12.8)  and  (12.9)  on  the  count  that  the  decision  maker  might  have 
a  nonindifferent  weak  order  <,  when  he  regards  f  as  virtually  impossible. 
In  (12.7)  we  can  take  care  of  this  by  setting  P*(s )  =  0,  but  the  only  way  of 
reflecting  “ s  is  impossible”  in  (12.9)  in  a  general  way  is  to  have  v(f  s )  con¬ 
stant  on  F  for  the  given  s,  and  if  (12.8)  is  to  hold  we  then  require  (/,  j) 

( g,s )  for  all  f,geF.  The  model  given  by  (12.8)  and  (12.9)  can  easily  be 
amended  to  handle  this  criticism  by  excluding  all  states  that,  in  the  judgment 
of  the  decision  maker,  cannot  possibly  obtain.  Such  states  will  be  referred 
to  as  null  states  in  the  next  two  chapters. 

The  model  and  therefore  the  independence  condition  can  easily  be  seen  to 
be  unreasonable  when  the  “state”  that  obtains  depends  on  the  selected  act. 
For  example,  suppose  you  want  to  sell  something  and  can  either  advertise 
(at  some  cost)  or  not  advertise.  Let  the  “states”  be  s  =  item  is  sold,  t  =  item 
is  not  sold.  Then  surely  advertise  <,  don’t  advertise,  and  advertise  <t  don’t 
advertise.  According  to  the  model  this  requires  that  advertise  < 
don’t  advertise,  which,  if  we  took  it  seriously,  would  say  that  one  should 
never  advertise. 

Perfect  Information  Acts 

An  alternative  to  using  the  <„  directly  is  to  work  only  with  <  on  a  set 
that  includes  F.  For  example,  let  J  be  the  set  of  functions  on  S  to  F:  a 
function  f  e  f,  which  assigns  act  f(.y)  to  state  s  for  each  se  S,  is  a  perfect 
information  act.  We  interpret  f  as  follows.  Suppose  the  decision  maker 
specifies  an  f  e  IF.  He  then  gives  this  function  to  an  imaginary  second  party 
who  has  perfect  information  about  which  state  obtains  and  who  proceeds  to 
implement  the  f(s)  e  Ffor  the  s  that  obtains.  In  terms  of  the  <„  the  decision 
maker’s  most  preferred  f  would  presumably  be  one  that  for  each  s  has 
f  ^»f(>s)  for  all  /ef,  (This  assumes  that  the  state  that  obtains  does  not 
depend  on  the  selected  act.)  F  is  the  subset  of  constant  functions  in  J. 

Under  this  formulation  we  are  interested  in  the  existence  of  a  real-valued 
function  v  on  F  x  5  that  satisfies 

f  <  fiol#).  s)  <  2^8(5)’  s)»  for  all  f,  g  e  (12.10) 

s  s 

Condition  C  of  Theorem  4.1  applies  directly  to  this  case.  That  is,  (12.10) 


170 


States  of  the  World 


holds  if  and  only  if  (fl(r),  •  •  • .  fm(*)  is  a  permutation  of  »  9w(j) 

for  each  s  e  S,  fy  =<  g'  for  j  —  1 , . . . ,  m  —  1]  =>  not  fm  <  gw. 

A  probabilistic  argument  (using  extraneous  probabilities)  that  supports 
this  independence  condition  proceeds  as  follows.  Suppose  P(s), . . . ,  fm(j) 
is  a  permutation  of  g!(s), . . . ,  gm(s)  for  each  s.  Let  2  0/m)f#  denote  an 
“alternative”  whose  “implementation”  is  carried  out  as  follows.  A  well* 
balanced  die  with  m  symmetric  faces  numbered  1  through  m  is  rolled  and  if 
face  j  occurs  then  f  is  used,  with  (f(s),  5)  the  resulting  act-  state  pair  if  s 
obtains.  2  (l/m)g/  has  a  similar  interpretation.  Supposing  for  convenience 
that  all  f*(s)  are  different  for  j  =  1, . . . ,  m,  if  s  obtains  then  2  (l/m)f*  gives 
each  of  (fK-s),  (fm(j),  s)  an  equal  chance  of  resulting.  The  same  is 

true  with  respect  to  2  (l/w)*!*,  and  since  gx(j), . . .  ,  gm(j)  is  a  permutation  of 
fl(i),  •  •  •  ,  fm(0  it  seems  natural  to  regard  2(1  /m)f  and  g'  as 

essentially  equivalent  if  s  obtains.  Since  this  is  true  for  each  s  we  would 
expect  that  2  (l/m)f  ~  2 

Now  if  in  fact  the  condition  is  violated  by  f*  <  g*  for  all  j  <  m  and  fm  <  gm 
we  would  then  expect  that  2  ( 1  1™)V  <  2  (l/m)S^>  which  violates  our 
“reasonable”  conclusion  that  2  (Mm)V  ~  2  (l/«) 9*- 

Extraneous  Probabilities 

The  model  given  by  (12.10)  can  be  embedded  in  a  model  that  uses  extrane¬ 
ous  probabilities.  In  particular,  let  IT  be  the  set  of  (simple)  probability  mea¬ 
sures  on  fF.  A  pseudo-operational  interpretation  for  P  e  (T  is  that,  using  P, 
an  f  g  J"  is  determined:  then,  if  s  obtains,  (f(s),  s)  is  the  resulting  act-state 
pair.  In  this  formulation  the  axioms  of  Theorem  11.1,  letting  X(  =  F  x  {sj, 
lead  to 


P  <  Qo 2  E[v(f,  s),  P ,]  <  2  EW,  s),  Q.],  (12.11) 

.s  .<? 

in  which  P,  is  the  marginal  of  P  on  F  x  {s}  and  the  »(-,  s)  for  the  s  e  S  are 
unique  up  to  similar  positive  linear  transformations.  (12.10)  follows  from 
(12.1 1)  when  we  define  f  <  g  oP  <  Q  when  P(f)  =  Q(g)  =  1. 


12.4  SUMMARY 

The  usual  states  of  the  world  formulation  views  the  acts  in  F  as  functions 
on  states  to  consequences.  The  states  represent  a  partition  of  the  potential 
realizations  of  the  world,  ideally  leaving  no  relevant  aspect  undescribed.  It 
is  usually  assumed  that  the  state  of  the  world  that  obtains  does  not  depend  on 
the  act  selected  by  the  decision  maker.  If  this  is  not  true,  new  states  that 
satisfy  the  independence  criterion  can  be  defined  as  functions  on  acts  to  the 
initial  set  of  states.  This  reformulation  is  similar  to  the  definition  of  states 


Extrclus 


171 


as  functions  on  acts  to  consequences,  as  suggested  in  this  chapter  in  connec¬ 
tion  with  the  acts-consequences  model  of  Part  II.  Under  a  fundamental 
agreement  between  the  usual  states  model  and  the  Part  II  model,  the  two 
models  are  seen  to  be  alternative  but  equivalent  ways  to  characterize  decision 
under  uncertainty  with  an  expected-utility  model. 

In  cases  where  acts  and  states  are  formulated  but  exact  consequences  may 
not  be  detailed,  independence  axioms  over  the  states  lead  to  additive  utility 
models  that  do  not  explicitly  include  probabilities  for  the  states. 


INDEX  TO  EXERCISES 

1.  Conditional  consequence  probabilities.  2.  Equivalence  of  two  approaches.  3.  Job¬ 
changing  example.  4.  Psychology  of  timing.  5.  Independence  axiom.  6.  Win-lose  example 
and  state  probabilities.  7-8.  Penalty  kick  example.  9.  Propose  to  the  other  girl.  10.  Theorem 
of  The  Alternative  for  (12.8)— (12.9). 


Exercises 

1.  Use  (12.1)  and  (10.5)  to  write  the  probability  of  “/  will  result  in  an  xeA, 
given  that  g  will  result  in  an  x  e  A'"  in  terms  of  P‘.  Then  use  (12.2)  to  write  the 
probability  in  terms  of  P*. 

2.  With  all  sets  finite  the  utility  of  act  f  in  the  Part  II  approach  can  be  written 
as  u(x)Pf{x),  and  as  u(f(s))P*(s)  in  the  states  of  the  world  model.  Assuming 
that  Pf(x)  *  P*({s:f(s)  =  x}),  show  that  u(f(s))P*(s)  =  u(x)Pf(x). 

3.  A  man  currently  making  $10000  per  year  has  been  offered  $14000  per  year 
by  another  company.  He  decides  to  give  his  company  notice  that  he  will  quit  unless 
he  gets  a  new  salary  of  $x.  He  decides  to  make  x  either  13000,  14000,  or  15000. 
The  higher  x  is,  the  more  likely  his  company  will  be  to  reject  his  ultimatum:  if  they 
reject,  he  will  take  the  new  job  at  $14000.  Formulate  his  decision  under  the  Part  H 
approach.  Then  reformulate  it  in  the  states  of  the  world  manner  so  that  the  state 
that  obtains  doesn’t  depend  on  the  selected  x. 

4.  Suppose  that  you  have  to  make  a  choice  between  /  and  g  when  the  pay-off 
from  your  choice  will  depend  on  the  outcome  of  one  flip  of  a  slightly  bent  coin  that 
you  have  been  shown.  Furthermore,  you  can  either  (1)  select /or  after  which  the 
coin  is  flipped  by  a  referee  or  (2)  have  the  referee  make  the  flip  before  you  choose 
/  or  g,  but  be  informed  of  the  outcome  of  the  flip  only  after  you  have  made  your 
choice.  Assuming  that  you  believe  the  referee  is  thoroughly  honest,  do  you  feel  that 
the  procedure  (1)  or  (2)  that  you  select  will  in  any  way  affect  your  decision  between 
f  and  gl  Explain  the  reason(s)  for  your  answer. 


172 


Stans  of  the  World 


5.  Adapted  from  Ellsberg  (1961)  and  Raiffa  (1961).  An  urn  contains  one  white 
ball  ( W )  and  two  other  balls.  You  know  only  that  the  two  other  balls  are  either 
both  red  ( R ),  or  both  green  ( G ),  or  one  is  red  and  one  is  green.  Consider  the  two 
situations  shown  below  where  W,  R,  and  G  represent  the  three  “states”  (which 
don’t  depend  on  the  act  selected)  according  to  whether  one  ball  drawn  at  random 
is  white,  red,  or  green.  The  dollar  figures  are  what  you  will  be  paid  after  you  make 
your  choice  and  a  ball  is  drawn. 

W  R  G  W  R  G 


f  $100  $0  $0 

g  $0  $100  $0 


/'  $100  $0  $100 

g  $0  $100  $100 


a.  Which  of/ and g  do  you  prefer?  Which  of /'  and g  do  you  prefer? 

b.  Show  that  the  pair  (g  <  /,/'  <g')  violates  the  following  independence 

axiom:  if  {/i(s),/2(s)}  =  for  each  seS  and  if fx  <gx  then  not 

f2  <g%-  Use  an  argument  like  that  following  (12.10)  to  argue  the  “incon¬ 
sistency”  of  (g  <  /,/'  <g^- 

c.  If  your  answers  in  (a)  were^  ■<  /and/'  <g',  does  ( b )  convince  you  that  there 
is  something  “wrong”  with  your  preferences?  Discuss  this. 

6.  Suppose  a  decision  maker  can  choose  one  of  two  strategies,/ and g,  and  his 
“opponent”  can  independently  choose  one  of  two  strategies,/'  and^'.  Our  decision 
maker  is  concerned  only  with  the  two  consequences  “win”  and  “lose.”  He  believes 
that  either  might  occur  for  each  of  the  four  strategy  pairs  in  {f,g}  x  {['  ,g'}-  Eight 
states,  displayed  along  the  top  of  Figure  12.1,  can  be  used  to  partition  his  “world.” 
Each  state  specifies  the  strategy  chosen  by  his  opponent  and,  for  each  of  /  and  g, 
specifies  whether  he  will  win  or  lose. 

a.  Does  the  state  that  obtains  depend  on  the  one  of  /  and  g  that  is  chosen  by 
our  decision  maker? 

b.  Suppose  an  additive  expected-utility  model  without  state  probabilities,  similar 
to  that  described  by  (12.1 1),  is  used  as  a  basis  for  estimating  the  v  numbers  in 
the  matrix  of  Figure  12.1.  According  to  this,^-  is  the  better  act  since  3  -F  1  < 
2  +  3.  In  the  usual  states  model,  characterized  by  (12.S),  we  would  have 
2?  u(f(si))P*(si)  as  the  expected  utility  for  /and  2?  «C?(s<))^*CO  as  the 
expected  utility  for  g  with  /Or,),  g(s{)  e  {win,  lose}  for  each  i.  Assuming  that 
these  two  models  agree  with  one  another  we  should  have  P*(s 2)  =  3 a. 


h 

J2 

S3 

*4 

J5 

*6 

•*7 

*8 

/ 

/ 

/ 

g 

/:  win 

/:  win 

/:  lose 

/:  lose 

f :  win 

/:  win 

/:  lose 

/:  lose 

g :  win 

g  :  lose 

g\  win 

g :  lose 

g:  win 

g:  lose 

g :  win 

g :  lose 

0 

3 

0 

0 

0 

1 

0 

0 

0 

0 

2 

0 

0 

0 

3 

0 

Figure  12.1  A  v  matrix. 


Exercises 


173 


P*(s3)  —  2a,  P*(s6)  =  a,  and  P*(s1)  —  3 a,  where  a  >  0.  Explain  why  this  is 
so.  What  can  you  teil  about  P*({slt  st,  s6,  r8})  from  the  data?  Is  there  any  need 
to  estimate  P*(s4),  P*(ss),  and  P*(s9)  when  the  usual  states  model  is 

applied? 

7.  In  soccer,  a  direct  penalty  kick  inside  the  box  can  be  viewed  as  a  two-person 
game  between  the  kicker  and  the  opposing  goalkeeper.  The  goalkeeper  can  select 
one  of  three  acts: 

/  =  stand  firm  until  the  kick  is  made; 
g  ■=  move  right  an  instant  before  the  ball  is  kicked; 
h  -  move  left  an  instant  before  the  ball  is  kicked. 

Assume  that  the  kicker  will  aim  right  or  left  (from  goalkeeper’s  orientation). 
Assuming.several  symmetries,  the  goalkeeper’s  probabilities  are  presented  in  Figure 
12.2:  p  is  the  probability  a  goal  will  not  be  scored  if  he  moves  right  and  the  kick  is 
right.  Surely  p  >  a  >  y. 


/ 

kick 

kick 

g 

kick 

kick 

ft 

kick  kick 

right 

left 

right 

left 

right 

left 

Goal  prevented 

a 

a 

P 

y 

7 

P 

Goal  scored 

|  1-a 

1  —  a 

1  -P 

1  -  y 

1  “  7 

1  ~P 

Figure  12.2  Conditional  probabilities  of  consequences. 


a.  If  the  goalkeeper  considers  a  right  kick  and  a  left  kick  equally  likely  (which 
may  of  course  be  false),  show  that  /  is  best  if  2a  >  p  +  y  and  that  either  g 
or  h  is  best  if  p  +  y  >  2a. 

b.  Reformulate  this  in  the  typical  states  model  with  acts  f,g,  and  h  and  16 
appropriate  states. 

8.  ( Continuation .)  Suppose  in  the  preceding  example  we  use  only  the  gross 
state-  *  =  kick  right  and  t  =  kick  left  and  that  an  estimate  of  v  on  [f,g,  ft}  x  {r,  f} 
in  acv,oid  with  (12.11)  gives  v(f,  s )  =  2,  v(g,  s )  =  6,  v(ft,  =  0,  and  v(f,  t )  =  3, 
v(g,  t )  —  C,  v(h,  t)  —  9.  According  to  this,  which  ac<.  is  most  preferred?  Describe 
the  best  perfect  information  act.  Do  the  v's  suggest  that  the  goalkeeper  considers 
s  more  ^"obable  than  tl  Why? 

9.  Suppose  an  extraneous  probability  model  like  that  described  by  (12.11)  gives 
the  following  v  matrix  on  F  x  S  for  the  marriage  example  of  Section  12.1 : 


^(both  “yes”)  j2(only  Alice  “yes”)  j3(only  Betsy  “yes”)  J4(both  “no”) 


Propose  to 
Alice 

Propose  to 
Betsy 


1 

0 


2 

0 


0  0 

4  0 


174 


States  of  the  World 


a.  Which  girl  would  the  yi  <ig  man  rather  marry? 

b.  Which  girl  should  he  propose  to?  (Which  act  is  preferred?) 

c.  Suppose  that  extraneou  probabilities  are  used  to  scale  the  young  man's 
utilities  on  the  three  consequences,  afte;  the  theory  in  Chapter  8,  and  that 
u(Stay  Single)  -  0,  w(Marry  Betsy)  =  3,  «(Marry  Alice)  =  4.  (That  is, 
“Marry  Betsy”  is  indifferent  to  a  gamble  with  probability  .75  for  “Marry 
Alice”  and  probability  .25  for  “Stay  Single.”)  Argue  that  this  data  along  with 
the  figures  in  the  above  matrix  suggest  that  P*  (Betsy  would  say  “yes”)  = 
(14/9)  P*  (Alice  would  say  “yes”). 

10.  Use  the  Theorem  of  The  Alternative  to  verify  that  the  independence  condition 
following  (12.9)  along  with  weak  order  is  sufficient  for  (12.8)-(12.9). 


Chapter  13 


AXIOMS  WITH  EXTRANEOUS 
PROBABILITIES 


This  chapter  gives  two  derivations  of  the  expected-utility  model 

f<  g  o  E[u(f(s)),  P*}  <  E[u(g(s)),  P*],  for  all/,  g  e  P,  (13.1) 

that  are  based  on  extraneous  probabilities  as  described  in  Section  12.2.  The 
first  derivation  assumes  that  S  is  finite  and  presupposes  a  minimal  overlap 
of  the  relevant  consequences  under  each  state  in  S.  The  second  makes  no 
restriction  on  the  size  of  S  but  assumes  that  all  consequences  are  relevant 
under  each  state.  Both  P*  and  u  are  derived  from  the  axioms.  Section  13.4 
shows  how  these  axioms  might  apply  to  the  decision  model  of  Part  II. 

All  probability  measures  in  this  chapter  are  defined  on  the  set  of  all  subsets 
of  their  basic  set.  “P*  on  S"  is  an  abbreviation  for  “P*  on  the  set  of  all 
subsets  of  S." 

13,1  HORSE  LOTTERIES 

The  purpose  of  this  section  is  to  define  many  of  the  terms  used  later  in  this 
chapter  and  to  prove  a  theorem  for  horse  lotteries,  which  are  the  elements  on 
which  <  is  applied  in  our  axioms. 

Throughout  the  chapter  S  is  the  set  of  states  of  the  world.  Subsets  of  S, 
called  events ,  will  be  denoted  by  {s},  A,  B,  C, .  . .  .  A  partition  of  S  is  a  set  of 
nonempty  events  that  are  mutually  exclusive  and  whose  union  equals  S.  Ae  is 
the  complement  of  A  in  S :  Ac  =  {s:s  A,  s  e  S}.  (A,  Ac}  is  a  two-part 
partition  of  S  provided  that  0  c  A  c  S. 

F  is  the  set  of  acts.  Each  fsF  is  a  function  on  S  into  X,  the  ::t  of  conse¬ 
quences.  AXs)  =  {f(s):feF},  the  set  of  consequences  under  state  s.  X  = 
Us  X(s).  CT(jf)  is  the  set  of  simple  probability  measures  (extraneous)  on 
X(s).  (T  is  the  set  of  simple  probability  measures  on  X. 

The  phrase  “horse  lottery”  was  introduced  by  Anscombe  and  Aumann 
(1963).  A  horse  lottery  is  a  function  on  S  that  assigns  a  P  e  $(s)  to  each  seS. 


175 


176 


Axioms  with  Extraneous  Probabilities 


*  p‘o  oil  hone  lotteries.  Horse  lotteries  are  denoted  in  bold  face 

for  P  gif’  ;f  P  •• We  “d°P''h'  followi"«  pseudo-operational  interpretation 
or  P  6  JC.  If  P  is  selected  and  s  g  S  obtains  then  P(j)  e  Jr,)  is  used  to 
determine  a  resultant  consequence  in  X(s). 

If  P,  Q  6  « land  0S«£1  then  aP  +  (1  -  «)Q  is  thc  horse  |ott  in  K 

that  assigns  «P(s)  +  (1  -  «)Q (s)  s  tf(j)  to  s  e  S,  for  each  s  e  £,  Under  this 
interpretation,  J£  is  a  mixture  set  (Definition  8,3). 

Taking  <  on  3t  as  the  basic  binary  relation,  P  ~  Q  <=>  (not  P  <  O  not 

}'  f  SS  (P  <  Q  or  P~Q).  Event  An 
whmever  P(s)  =  Q(s)  for  every  s  e  A’.  State  s  is  null  o  (r)  is  null. 

The  following  theorem  is  similar  to  Theorem  11.1. 

PHQ0R  £  ^ 13’1’  SupP°Se  that  S  isfinite  and  that  the  following  hold  for  all 
-<41.  <  on  3 Z  is  a  weak  order; 

In  <  2’  °  <  “  <  ^  ^  aP  "i"  (*  “  a)R  <  *Q  +  0  -  a)R’ 

A3.  (P  <  Q,  Q  <  R)=>  ocP  +  (1  -  a)R  <  Q  and  Q  <  0P  +  (1  _  tf)R 

flZL^e(0,l>  IT;  "i,h  S={s> . <*•»■»  'ooi-vaZd 

f  ns  Uj . un  on  X(st), .  . .  ,  X(s„)  respectively  such  that 

p  ^  E(Ui'  2  E(uit  Q(s,)),  for  all  P,  Q  e  je,  (13.2) 

and  the  u,  that  satisfy  (13.2)  are  unique  up  to  similar  positive  linear  trans- 
j  or  motions,  with  ut  constant  on  X{st)  if  and  only  ifst  is  null. 

«By  Jhe°Iem  8‘4,  there  is  a  real-valued  function  «  on  K 
hat  satisfies  P  <  Q  <=>  U(P)  <  „(Q)  and  u(aP  +  (1  _  a)Q)  =  ay(p) 

3  ,S  UmqUe  Up  10  3  P°sitive  ,inear  transformation  when  it 
satishes  these  properties.  For  convenience,  we  shall  write  Q  =  (0(5,1 

QCO)  -  (0i, ...  ,  Qn ),  with  Qi  e  {T(j,)  for  each  /.  ^  , 

Fix  R  =  (Ru  ...  ,Rn)  in  3e  and  let  P,  =  (/?., p.  r  R  \ 

Jhen,^h  p  =  (p‘’  ■■■• p").  cwp + («■  -  i)/n)R =i-  •;  d^p.'  ThJ: 

fore  u(P)  +  (n  -  l)u(R)  =  „(P,).  Defining  - 

ui(pi)  =  w(Pf)  -  ((«  -  l)/n)n(R), 

summat|n  over  i  gives  I?,,  «,Wi)  -  lU  u(P.)  -  <n  -  |)„(R,,  s0  that 

•  n  ^1’  ' '  ’  Q{*  R«'+i’  ■  •  ■  »  pn)-  Then,  by  the  preceding 

result,  i/(«Pi  4-  (1  -  «)Q,)  -  «,(«J»  +  (I  -  *)0(.)  +  «,(*,)  In  addi 

«(aP,.  +  (1  -  a)Q,)  =  au(P,)  +  (1  -  a)n(Q,) 

.  +  0  -  «)«.(0.)  +  1j,  , 

so  t  at  ui(y.Pt  +  (1  a)0f)  -  ««,(P/)  +  (1  —  « )ui{Qi).  Since  the  elements 


Finite  States  Theory 


177 


in  (Tfo)  are  simple  measures,  =  E(u(,  P,)  and  (13.2)  follows  with 
u^x)  =  ut(Pi)  when  Pfx)  =  1. 

Uniqueness  up  to  similar  positive  linear  transformations  follows  readily 
from  the  uniqueness  property  for  u.  If  vt  satisfy  (13.2)  along  with  the  «,  then, 
with  t>(P)  ■«  2r=i  E(vit  Pi)>  v  =  au  +  b  and  a  >  0.  Holding  P,  fixed  for  all 
j  i,  it  then  follows  that  vt  =  au{  +  b(.  This  holds  for  each  /. 

Clearly,  u{  is  constant  on  X(st)  if  and  only  if  is  null.  ♦ 

13.2  FINITE  STATES  THEORY 

In  order  to  derive  u  on  X  =  Us  X(s)  and  P*  on  S  on  the  basis  of  Theorem 
13.1  when  S  is  finite,  two  more  axioms  will  be  used.  The  first  of  these  (44) 
assumes  that  two  consequences  xt  and  x*  appear  in  every  X(s)  and  that  they 
are  not  indifferent.  Hence  {a:*,  a?*}  £  X(s)  n  X(t)  for  s,t  e  S.  With  a  con¬ 
venient  abuse  of  rigor  we  shall  say  that  a  simple  probability  measure  P  that 
assigns  probability  1  to  A^s)  n  X{t)  is  in  both  ^(j)  and  (T(t),  and  write 
P  e  $(s)  n  0*(r). 

The  second  new  axiom  (45)  is  a  monotonicity  axiom.  It  says  that  if  s  and  t 
are  not  null  then  there  is  the  same  order  under  both  states  for  all  P  e  iT(j)  n 
1T(/).  In  other  words,  preferences  on  consequences  or  simple  probability 
measures  on  consequences  that  can  occur  under  different  states  shall  not  be 
state  dependent. 

THEOREM  13.2.  Suppose  that  the  hypotheses  of  Theorem  1 3.  i  hold  and 
that ,  in  addition, 

44.  There  are xt,  x*  e  X(s)  for  every  s  e  S  such  that  P  <  Q  when  P  (s)  [Q($)] 
assigns  probability  1  to  x*  [x*]for  each  s  e  S; 

45.  Ifs,  t  G  S,s  and  t  are  not  null,  P,Qe  3'(s)  n  (T(/),  and  if  P  e  3E,  then 
(P  with  P(s)  replaced  by  P)  <  (P  with  P(s)  replaced  by  Q)  o  (P  with  P(f) 
replaced  by  P)  <  (P  with  P(f)  replaced  by  0). 

Then  there  is  a  real-valued  function  u  on  X  and  a  probability  measure  P* 
on  S  such  that 

P  <  Q  o  E[E(u,  P(j)),  P*]  <  E[E(u ,  Q (s)),  P*],  for  all  P,  Q  e  JE, 

(13.3) 

with  P*(s)  —  0  if  and  only  if  s  is  null.  In  addition ,  if  v  on  X  and  a  probability 
measure  Q*  on  S  satisfy  (13.3)  along  with  u  and  P*  then  P*  =  Q*  and  there 
are  numbers  a  >  0  and  b  such  that  v(x)  =  au(x)  +  b  for  all  x  e  U{,  >t»ot  nuin 
X(s). 

If  we  define  <  on  F  from  <  on  JEby/<goP<Q  when  P(s)[Q(.s)] 
assigns  probability  1  to /(s)[g(j)]  for  each  s  e  S,  (13.1)  follows  immediately 
from  (13.3). 


178 


Axioms  with  Extraneous  Probabilities 


Proof.  Let  S  =  ,  s„ }.  Beginning  with  the  results  of  Theorem  13.1 , 

A 4  implies  that  /  =  e  S  and  st  is  not  null}  is  not  empty.  If  (13.3)  is  to 
hold  then  P*(s()u  must  be  a  positive  linear  transformation  of  w,  and  hence 
P*(s)  =  0  -o  s  is  null. 

If  /  =  {/},  (13.3)  follows  from  (13.2)  on  setting  P*(s4)  ==  1  and  u(x )  =  u^x) 
for  all  x  e  X(st).  u  on  the  rest  of  X  is  arbitrary.  Clearly,  u  on  X(s4)  is  unique 
up  to  a  positive  linear  transformation. 

If  /  has  more  than  one  element  let  denote  the  set  of  simple  probability 
measures  on  Xu  =  X{st)  D  X(sj)  when  i,j  e  I.  With  a  convenient  lapse  in 
rigor,  take  P  e  tf^s,)  and  P  e  tr(^)  when  P  e  i?if.  By  (13.2)  and  AS,  E(uiy  P)  < 
E(uiy  Q)  o  E(uh  P )  <  E(ujy  Q)y  for  all  P,  Q  e  $ti.  Then,  by  A 4  and  the 
latter  part  of  Theorem  8.2,  there  is  a  unique  r(j  >  0  (invariant  under  similar 
positive  linear  transformations  of  u,  and  «3)  such  that 

Ui(x)  -  ut(xm)  =  rtj[uf(x)  -  ufx*)]  for  all  x  e  Xif.  (13.4) 
Fix  tel  and  define  P*  and  u  by 

P*(s<)  =  rit  /j  rit  for  all  i  e  1 
I  iel 

P*(st)  =  0  for  all  i  $  I 

u(x)  =  [ut(x)  —  Uj(Xt)]/P*(si)  when  x  e  X(si)  and  i  e  I 

and  u(x)  *=  0  when  x  d  U/  X(s{).  To  show  that  u  is  well  defined  we  need  to 
prove  that 

u((x)  —  Ui(x #)  =  — '  [Uj(x)  —  M,(ar^)]  when  i,j  e  /  and  x  e  Xif. 

ru 

(13.5) 

By  (13.4),  ritlru  ([«,(**)  -  #,(**  )]/[«,(**)  -  ut (xm )])/([Uj (x*)  -  ufx,)]/ 
[u,(x*)  -  ut(x„)])  rn  [ ufx *)  -  u((xm)]f[ut(x*)  -  u,(x»)]  «=  rijy  so  that  (13.5) 
follows  from  (13.4)  for  i,j.  Substitution  for  into  (13.2)  then  yields  (13.3). 

It  follows  easily  from  «(**)  <  u(x*)  and  the  uniqueness  assertions  of 
Theorem  13.1  that  P*  is  unique  and  u  on  U,  AXr,)  is  unique  up  to  a  positive 
linear  transformation.  ♦ 

13.3  HOMOGENEOUS  HORSE  LOTTERY  THEORY 

The  definitions  of  Section  13.1  apply  to  this  section. 

When  5  is  infinite,  the  horse-lottery  approach  meets  serious  mathematical 
difficulties  if  we  assume  only  a  minimal  overlap  of  the  X(s).  Therefore,  we 
shall  assume  throughout  this  section  that  X  —  X(s)  for  all  s  e  S.  Some  addi¬ 
tional  definitions  follow. 


Homogeneous  Horse  Lottery  Theory 


179 


P  is  constant  on  event  A  <=>  P($)  =  P(/)  for  all  s,  t  e  A.  When  P(s)  =  P 
(in  iT)  for  all  s  e  A ,  we  shall  say  that  P  =  PonA.P  =  Q  on  ,4  o  P  (s)  =  Q(s) 
for  each  s  e  A.  Thus  A  is  null  o  P  ~  Q  whenever  P  =  Q  on  Ac. 

With P,  Q  we  define  <  on  ‘S  on  the  basis  of  <  on  36  thus:  P  <  Qo 
P  <  Q  when  P  =  P  and  Q  -=  Q  on  S.  Also,  P  <  Q  o  P  <  Q  when 
P  =  P  on  S.  P  Q,  P  ~  Q,  P  Q, . . .  are  defined  in  similar  fashion. 

After  stating  our  main  theorem  we  shall  prove  it  by  proving  a  series  of 
lemmas. 

THEOREM  13.3.  Suppose  that  the  following  axioms  hold  for  all  P,  Q,  R  6 
X; 

B\.  <  on  X  is  a  weak  order; 

B2.  (P  <  Q,  0  <  a  <  1)  =>  aP  +  (1  -  a)R  <  aQ  +  (1  -  a)R; 

B3.  (P  <  Q,  Q  <  R)  =s>  otP  +  (1  -  a)R  <  Q  <  /?P  +  (1  -  0)R  for  some 
<x.,fie(0,  1); 

B4.  P  <  Q  for  some  P ,  Q  e  (F ; 

B5.  ( Event  A  is  not  null,  P  =  P  and  Q  =  Q  on  A,  P  =  Q  on  A°)=> 
(P  <QoP  <  Q); 

B6.  P(j)  <  R  for  all  s  e  S=>  P  <  R.  R  <  Q(s)for  all  s  e  S=>  P  <  Q. 

Then  there  is  a  real-valued  function  u  on  X  and  a  probability  measure  P* 
on  S  that  satisfy  (13.3).  Moreover,  when  (13.3)  holds  for  u  and  P*, 

Cl.  Every  P  e  X.  is  bounded.  That  is,  given  P  e  X,  there  are  real  numbers 
a  and  b  such  that  P*({s.a  <>  E(u,  P(s))  £})  =  1 ; 

C2.  for  all  A  S  S,  P*(A )  =  0  o  A  is  null; 

C3.  u  is  bounded  if  there  is  a  denumerable  partition  of  S  such  that 
P*(A)  >  0  for  every  event  in  the  partition; 

C4,  A  real-valued  function  u'  on  X  and  a  probability  measure  Q*  on  S 
satisfy  (13.3)  in  place  of  u  and  P*  if  and  only  if  Q*  =  P*  and  u'  is  a  positive 
linear  transformation  of  u. 

B4  says  that  there  is  some  pair  of  constant  horse  lotteries  that  are  not 
indifferent.  B4  and  X  =  AXs)  for  all  s  e  S  supplant  A4  of  Theorem  13.2. 

B 5  is  an  obvious  monotonicity  axiom  for  nonnull  events.  B6  is  a  form  of 
sure-thing  or  dominance  axiom.  It  is  similar  to  axiom  A4a  in  Section  10.4 
and  to  PI  in  the  next  chapter.  B6  is  needed  only  if  5  is  infinite. 

In  addition  to  the  noted  conclusions  of  Theorem  13.3  it  should  be  remarked 
that  P *  has  no  special  properties  apart  from  those  of  a  probability  measure. 
If  S’  is  infinite,  it  may  or  may  not  be  true  that  P*(A )  =  1  for  some  finite 
A  £  S.  If  P*(A)  <  1  for  every  finite  A  c  5  it  may  or  may  not  be  true  that  S 
can  be  partitioned  into  an  arbitrary  finite  number  of  events  each  with  equal 
probability.  In  addition,  u  has  no  special  properties  other  than  those  noted 


180 


Axioms  with  Extraneous  Probabilities 


in  C3  and  C4,  except  that  it  cannot  be  constant  on  X.  u  might  be 
unbounded  when  the  condition  of  C3  does  not  hold. 

Proof  of  Theorem  13.3 

To  prove  the  theorem  we  shall  prove  a  series  of  statements  that,  taken 
together,  establish  all  conclusions  of  the  theorem.  For  convenience  we  first 
list  these  statements,  u  and  P*  for  S2-S5  are  defined  as  in  51. 

51.  51-55  =>  (13.3)  for  all  P,  Q  e  X0  where  X0  =  {P:P  6  X  and  P  is 
constant  on  each  event  in  some  finite  partition  of  5}.  u  and  P*  for  (13.3)  on  X$ 
are  unique  up  to  a  positive  linear  transformation  and  unique ,  respectively,  and 
P*(A)  =  0  if  and  only  if  A  is  null. 

52.  51-56  =>  (13.3)  for  all  bounded  horse  lotteries.  (See  Cl.) 

53.  (51-56,  there  is  a  denumerable  partition  of  S  such  that  P*(A)  >  0  for 
every  A  in  the  partition)  =>  u  on  X  is  bounded. 

54.  If  for  each  positive  integer  n  there  is  an  n-event  partition  of  5  for  which 
each  event  has  positive  probability  under  P*,  then  there  is  a  denumerable 
partition  of  5  such  that  P*(A)  >  0  for  every  A  in  the  partition. 

55.  If  the  hypotheses  of  54  are  false  then  51-56  imply  that  all  horse 
lotteries  are  bounded.  (In  this  case  it  is  not  necessarily  true  that  P*  is  a  simple 
probability  measure.  See  Exercises  5  and  6.) 

Note  that  53,  54,  and  55  imply  that  all  horse  lotteries  are  bounded.  If  the 
hypotheses  of  54  are  true  then,  by  53,  u  on  X  is  bounded  and  hence  all  P  e  X, 
must  be  bounded.  On  the  other  hand,  if  the  hypotheses  of  54  are  false  then, 
by  55,  all  P  e  X  are  bounded  even  though  u  on  X  might  be  unbounded. 

Proof  of  SI.  Let  {5ls . . .  ,  5n}  be  a  finite  partition  of  5.  Then,  by  essenti¬ 
ally  the  same  proof  used  for  Theorem  13.2,  there  are  nonnegative  numbers 
Pji(Bi), . .  .  ,  Pb(B„)  that  sum  to  one  and  there  is  a  real- valued  function  uB 
on  X  such  that,  whenever  P  =  Pi on  Q  =  Q(  on  5<  (/  =  1, ....  n) 

P  <  Q^|  P&BMun,  P.)  <  |  Qi),  (13.6) 

l*“l  1=1 

and  when  this  holds  P%{Bt)  =  0  if  and  only  if  5,  is  null,  PB  is  unique,  and  uB 
is  unique  up  to  a  positive  linear  transformation. 

Let  Xc  be  the  set  of  all  constant  horse  lotteries  in  X.  Thus  Xc  c  JC0.  If 
P,  Q  e  S’  and  if  (5X, ....  5„}  and  {Cx, . . .  ,  Cm}  are  partitions  of  5  then,  by 
(13.6),  E(u1{,  P)  <  E(uu,  Q)  <=>  E(uc,  P)  <  E(uc,  Q ).  Hence,  noting  that 
3EC  is  a  mixture  set  for  which  51,  52,  and  53  hold,  it  follows  from  Theorem 
8.4  that  uc  on  A'  is  a  positive  linear  transformation  of  uB  on  X.  Therefore, 
we  can  drop  the  partition-specific  subscript  on  u  and  have,  in  place  of  (13.6), 

P  <  Q<>I  PB(Bi)E(u,  P.)  <  i  Pl{Bt)E{u ,  Q{),  (13.7) 

(=i  <-i 

in  which  u  is  unique  up  to  a  positive  linear  transformation. 


Homogeneous  Horse  Lottery  Theory 


181 


For  event  A  £  S  let  JCA  m  {P:P  e  JC  and  P  is  constant  on  A  and  on  A*}, 
If  {Blt . . ,  ,  Bn)  and  {C1; . . .  ,  Cm}  are  partitions  of  S  and  if  A  is  an  element 
in  each  partition  then  (13.7)  implies  that,  for  all  P,  Q  e  3CA,  with  P  =  PA 
and  Q  =  Qa  on  A,  and  P  —  PA  and  Q  =  QeA  on  Ae , 

P%A)E{u,  PA)  4-  Pb(Ac)E(u,  Pa)  <  Pg(A)E(u,  Qj  +  P*n(A')E(u,  QeA) 

oP&A)E(u,  PA)  +  P*(Ac)E(u,  P'A) 

<  Pc(A)E(Ui  Qa)  +  P%A‘)E(u,  Qca). 

It  then  follows  from  the  version  of  (13.7)  for  the  partition  { A,  A9}  that 
P%(A)  —  P*(A),  so  that  we  can  drop  the  partition-specific  subscript  on  P* 
and  write  (13.7)  as 

P  <  Q<=>!  P\Bt)E{ut  Pt)  <  2  P*(Bf)E(u,  Q{).  (13.8) 

i-i  i=i 

Adding  P*( 0 )  =  Oto  complete  P*,  it  follows  that  P*  is  uniquely  determined 
and  that  P*(A)  —  0  if  and  only  if  A  is  null.  Finite  additivity  for  P*  is  easily 
demonstrated  using  partitions  {A,  B,  (A  u  B)e}  and  { A  u  B,  (A  u  5)®} 
with  A  n  B  =  0  in  an  analysis  like  that  leading  to  (13.8). 

Finally,  to  obtain  (13.3)  for  all  of  3£0,  let  P  =  P*  on  B{  and  Q  =  Q ,  on  Cs 
for  partitions  {Blt .  . .  ,  Bn)  and  {Clf ....  Cm}.  Applying  (13.8)  to  the 

partition  {^0^1*1 . »;/=l,...,m;llinC,^0),  we  get 

P<Qo2iIi  P*(B<  n  Cf)E(u,  Pj  <  Z  Xi  P*(Bi  r\  Cs)E{u,  OJ.  By  fi¬ 
nite  additivity,  the  last  expression  is  Xi  P*(Bi)E(u,  Pt)  <  Xi  P*(Q)£(m>  Qi). 

♦ 

Proof  oj  S2.  Since  Je  is  a  mixture  set.  Theorem  8.4  implies  that  there  is  a 
real-valued  function  p  on  J£  that  satisfies  P  <  Q  <=>  u(P)  <  v(Q)  and 
v(aP  +  (1  —  a)Q)  =  av(P)  +-  (1  —  a)u(Q).  Since  these  expressions  hold  on 
3e0  £  Je  it  follows  from  (13.3)  for  JE0  and  Theorem  8.4  that  w  on  J£0 
defined  by  w(P)  =  E[E(u,  P(s)),  P*],  is  a  positive  linear  transformation  of 
the  restriction  of  v  on  X0.  Without  loss  in  generality  we  can  therefore  specify 
that 

p(P)=£[£(«,P(4f*]  (13.9) 

for  all  P  <3  J£0,  with 

P  <  Q  u(P)  <  »(Q),  for  all  P,  Q  e  J6  (13.10) 

«(aP  +  (1  —  «)Q)  =  av(P)  +  (t  -  a)e(Q), 

for  all  (a,  P,  Q)  6  [0,  1]  x  Je*.  (13.11) 

According  to  (13.10),  52  is  proved  if  (13.9)  holds  for  all  bounded  P  e3C. 
Our  first  step  toward  this  end  will  be  to  show  that 

c  =  inf  {E(u,  P (s)):s  e  A}  <,  v(P)  £  sup  {E(u,  P(s)):s  e  A}  —  d  (13.12) 


182 


Axiom  with  Extraneous  Probabilities 


holds  when  P*(A)  —  1  and  c  and  d  as  defined  are  finite.  Let  Q  =  P  on  A  and 
c  <,  E(u ,  Q(f))  <,  d  on  Ae.  Since  Ae  is  null,  Q  —  P  and  t?(P)  =  v(Q )  by 
(13.10).  To  verify  c  ^  t»(Q)  <,  d  with  c  and  d  finite,  suppose  to  the  contrary 
that  d  <  u(Q).  With  c  <,  E(u,  Q')  £  d  and  Q'  =  Q’  on  S ,  let  R  =  <xQ  + 
(1  —  a)Q'  with  a  <  1  near  enough  to  one  so  that  d  <  u(R)  =  au(Q)  -f 
(1  -  a)y(Q')  <  t>(Q),  Then  R  <  Q  by  (13.10).  But  since  E(u ,  Q(j))  £ 
d  <  u(R),  it  follows  from  (13.10)  that  Q(s)  <  R  for  all  s  e  S  and  hence  by 
B6  that  Q  ^  R,  a  contradiction.  Hence  v(Q)  <>d.  By  a  symmetric  proof, 
c  <L  v(Q). 

With  P  bounded  let  A  with  P*(A)  =  1  be  an  event  on  which  E(u,  P (s))  is 
bounded  and  define  c  and  d  as  in  (13.12).  If  c  =  d  then  (13.9)  is  immediate. 
Henceforth  assume  that  c  <  d:  for  convenience  we  shall  take  c  —  0,  d  =  1. 
Let  Q  be  defined  as  in  the  preceding  paragraph  so  that  t>(P)  =  v(Q)  and 
E[E(u,  P(s)),  P*\  —  E[E(u,  Q(s)),  P*],  the  latter  by  Exercise  10.22.  To  show 
that  v(Q)  —  E[E(u,  Q(s)),  P*],  let  {^lf . . .  ,  An}  be  the  partition  (ignoring 
empty  sets)  of  S  defined  by 

A,  -  {s: 0  <;  E(u,  Q(s))  £  1  /«} 

A{  =  {s:(i  —  1  )/n  <  E(u ,  Q ($))  <,  i/n)  /  =  2, . . . ,  n, 
and  let  P{  e  3*  be  such  that 

(i  —  1 )/«  £  E(u,  P,)  <,  i/it  for  /  =  1, . . .  ,  n.  (13.13) 
The  existence  of  such  P,  is  guaranteed  by  (13.12).  Let  P,  =  Q  on  Ai  and 
Pi  —  Pi  on  A\  (*  =  1 , «);  let  P0  =  2<U  (l/*)p«;  and  let  R  = 
2>*<  0/0*  -  1))P,  on  A(  for  /  =  1, . . .  ,  n.  Then  P0(.s)  m  2»  (1/«)P*0)  ~ 
(l/n)Q(5)  +  ((«  —  l)/fl)2,\*i  (l/(«  —  1))P,  when  s  e  Aiy  so  that  P0  = 
(l/n)Q  +  ((n  -  1)1  n)R.  Hence,  by  (13.11)  and  P0  =  2,  (1  /b)P„ 

v(Q)  =  I  »(P.)  -  (n  -  l)u(R).  (13.14) 

1  =  1 

Since  R  e  3t0,  (13.9)  implies  that  t>(R)  =  2<£(m,  2/*»  0/0*  —  1  ))Pi)P*(Ai)  = 
(1  /(«  -  1))  ii  [2 niE{u,  P,)]P*(^j.  Substituting  this  in  (13.14)  gives 

v(Q)  =  2  u(P.)  -  2  2  E(u,  P,)P%4,).  (13.15) 

i=l 

Now  by  (13.13),  (13.12),  and  the  definition  of  P< 

(/'  —  1  )/n  <;  u(P,)  ^  i/n  for  /  -  1 (13.16) 

Since  0  =  inf  {E(u,  Q(s)):s  e  S }  and  1  =  sup  {E(u,  Q(s)):s  e  S },  P,  that 
satisfy  (13.13)  can  be  selected  so  that  either 

E(u,Pi)  —  l  jn,  and  E(u,  P()  =  (i  -  l)/n  for  /  >  1  (13.17) 

or 

E(u,  Pj)  =  i/n  for  i  <  n,  and  E{u,  P„)  =  (n  —  1  )/n.  (13.18) 


Homogeneous  Horse  Lottery  Theory. 


183 


Applying  (13.17)  and  the  left  side  of  (13.16)  to  (13.15),  we  get 

*<Q)  £  2  —  -  ^  P*(A)  -  i 

.-i  n  2  wL  2  n  n  J 

■=  ^  +  i  —  r"(A.)  -  i  [i  -  p*(^,)| 

2  2  i-a  n  n 

rt  /  —  1  1 

£  1 1 - -  P*(A{)  -  -  . 

»-i  n  n 

Applying  (13.18)  and  the  right  side  of  (13.16)  to  (13.15),  we  get 

kq>  si-  -  zT~  - 2  +  — In*,)  -  p\a.) 

1=1  n  <=iL  2  n  n  J  2 

=■  i  -  p'w  + 1  u  -  p*(^.)i 

t=i  n  n 

<.iLp\* u  +  i. 

i=i  n  rt 

By  the  definition  of  E,  2  ((*  “  l)/«)P*(/4<)  £  £[£(u,  Q(s)),  P*]  £ 
2  (///t)P* (/!<),  so  that  |i?(Q)  -  £[£(u,  Q(s)),P*]|  £  2/*  for  n  ~  1 , 2, . . .  . 
Hence,  »(Q)  =  £[£(u,  Q(a)),  P*].  ♦ 

Proo/  o/  S3.  Let  be  a  denumerable  partition  of  S  with  P*C4)  >  0  for 
all  Ae  A.  {P*(A)\A  e  must  have  a  largest  element,  say  P*(A i).  Then 
{P*(A):A  e  A  —  {^j}}  must  have  a  largest  element,  say  A2.  Continuing  this, 
we  get  a  sequence  Ax,  A2, . .  .  with  {^x,  42> . . .}  ==  A  and  P*(A()  ;>  P*(/l<+i) 
fori  «  1,2 . 

Contrary  to  S3  suppose  that  u  is  unbounded  above.  By  a  linear  trans¬ 
formation  of  u  we  can  assume  that  [0,  oo)  £  {E(u,  P):P  e  if}.  Let  P,  e  3“  be 
such  that 

£(«,  P.)  =  1/P*(A,)  for  i  =  1, 2, ...  .  (13.19) 

Let  P  =  P,  on  A,  (i  =  1 , 2, . . .)  and  let  Q„  be  constant  on  each  At  for  i  <,  n 
and  constant  on  U£n+i  A,  with 

E(u,Qn(s))=  P'iAX1 -P*(A(rl  for  all  se^;  i  =  1, . . . ,  n 
E(u,Q„(s))  =  0  for  all  se  0  A (13.20) 

»**n+l 

Let  v  on  X  satisfy  (13.10)  and  (13.11)  and  also  (13.9)  on  3C„.  Then 

v(Qn)  =  2  [p*(Anr 1  - 

*  P*(Anrl  2  P*(Ai)  -  »  for  n  -  1,  2, ...  .  (13.21) 


184 


Axioms  with  Extraneous  Probabilities 


By  (13.19)  and  (13.20),  E(u,  *P(j)  +  iQ„(s))  =  +  £[P*(,4n)-i  - 

P*w  »]  =  \P*(An)-'  for  all  S  e  U?  At  and,  by  P*(Af)  ^  P*{Ai+1)  and 
(13.20),  £(u,  £P(s)  +  lQn(j))  £  \P*(Anrl  for  all  se  (J*+l  At.  Therefore 
inf  {E(u,  £P(s)  +  iQn(j)): s  e  5}  =  \P*(A„y\  Hence,  by  (13.12),  e(JP  + 
iQ„)  iP*M„)_1,  which  on  using  (13.11)  and  (13.21)  implies  that 

KP)  >  P*(An)~l  -  P'iAJ-1  J  p*(At)  +  n>n  for  n  =  1,  2, .  .  .  . 

i=l 

But  this  requires  v(P)  to  be  infinite.  Hence  u  is  bounded  above.  A  symmetric 
proof  shows  that  u  is  bounded  below  under  53’s  hypotheses.  + 

Proof  of  S4.  For  each  integer  n  ^  2  let  An  be  an  n-part  partition  of  5  each 
event  in  which  has  positive  probability.  Define  a  new  set  of  partitions 
3)2,  $3, . .  .  recursively  as  follows: 

$2  =  A2 

3in  =  {A  n  B:  A  e  An,  B  e  3n  l,  A  n  B  *  0),  n  =  3,  4 . 

It  is  easily  verified  that  3in  contains  at  least  n  events  with  positive  probability 
and  that  3)n+1  is  as  fine  as  3Sn  so  that  B  e  $n+I  =>  C  e  &n  for  some  C  that 
includes  B.  For  each  A  e  $2  let 

K(A)  =  number  of  events  in  3in  ( n  >  2)  that  are  included  in  A  and  have 
positive  probability. 

With  3S2  =  {A,  Ac}  it  follows  that  Nl(A)  +  N^(AC)  ;>  n  for  n  =  3, 4, . . .  . 
Thus,  as  n  gets  large,  at  least  one  of  N^A)  and  N\{Ae)  approaches  infinity. 
Let  Ax  be  an  event  in  B>2  for  which  N^A^  ao  and  let  Bx  =  A\.  Then 
P*(P i)  >  0  and  Bt  will  be  the  first  element  in  our  desired  denumerable 
partition. 

Let  n(l)  be  an  integer  for  which  contains  more  than  one  subset  of 
Al  that  has  positive  probability.  For  each  A  £  Ax  and  A  e  53'i(I)  let 

•^n(A)  =  number  of  events  in  3in  ( n  >  /t(  1 ))  that  are  included  in  A  and  have 
positive  probability. 

Let  A  =  {A: A  £  Au  A  e  &*<»>}.  Then  2a  N2n{A)  =  N£AX)  so  that,  since 
^n(^i)  *■  °o  as  /i  >  oo,  at  least  one  N2(A)  — ►  ao  as  n  — oo.  Let  Az  e  A  be 
such  an  A  and  let  B2  =  A1  n  A\.  Then  P*(Bt)  >  0  and  {Bx,  B2,  Az}  is  a 
partition  of  S  with  A^(/f2)  -*•  oo  as  n  -*  oo. 

Continuing  this  construction  in  the  obvious  way  gives  a  denumerable 
sequence  Bx,  B2,  B3, ...  of  mutually  disjoint  events  each  of  which  has  positive 
probability.  The  conclusion  of  54  follows.  + 

Proof  of  S5.  It  the  hypotheses  of  54  are  false  then  there  is  a  unique 
positive  integer  m  for  which  there  is  an  m-event  partition  of  5  that  has 


The  Pan  If  Ptciuoa  Model 


IS5 


positive  probability  for  eacn  event  and  such  that  no  partition  of  S 
has  positive  probability  on  more  than  m  of  its  events. 

For  convenience  assume  that  u(y )  —  0  for  some  y  e  X.  Suppose  then, 
contrary  to  the  conclusion  of  55,  that  Q  is  unbounded  above.  Let  P  be 
obtained  from  Q  by  replacing  each  x  for  which  Q(s)(x)  >  0  and  w(x)  <  0  by 
y  with  u(y)  =  0,  for  all  seS.  Then  E(u,  P(s))  >  0  for  all  s  and  P  is  unbounded 
above.  Then,  for  every  n  >  0, 

P*{E{u,  P (s))  >n}  =  P*({s:E(u,  P (s))  £  n})  >  0. 

By  the  preceding  paragraph,  P*{E(u,  P(s))  ;>  r)  can  change  no  more  than  m 
times  as  n  increases.  Hence  there  is  an  N  and  an  a  >  0  such  that 

P*{E(u,  P(sj)  £  n}  -  a  for  all  n  £  N.  (13.22) 

Let  £(«,  P^  =  /  for  1,2,...,  let  Qn  =  P  ■  r  {s:E(u,  P(s))  >  n}, 
Qn  =  Pn  on  {s:E(u,  P(s))  <  «},  and  let  R„  =  Ps>  on  {s:E(u,  P(s))  ]>  n), 
R„  =  P  on  { s:E(u ,  P(j))  <  n}.  Then,  with  P„  —  Pn  on  5,  £P  +  £P„  = 
iQn  +  £R„,  so  that  with  v  on  J£as  given  by  (13.10)  and  (13.1 1)  and  satisfying 
(13.9)  for  all  bounded  horse  lotteries, 

»(P)  +  n  =  v(Qn)  +  v(R„)  n=  1,2,...  .  (13.23) 

Since  R„  is  bounded,  (13.9)  and  (13.22)  give  y(R„)  =  E[E(u,  Rn(s)),  /**]  >  not 
for  all  N.  Since  Pn_x  <  Qn(s)  for  all  se  S,  B6  implies  that  Pn_,  <  Qn 
so  that  t>(Q„)  ^  n  —  1  for  all  n.  Then,  using  (13.23), 

t>(P)  not  —  l  for  all  n  N, 

which  contradicts  the  finiteness  of  c(P).  Hence  Q  is  not  unbounded  above.  A 
symmetric  proof  shows  that  Q  is  bounded  below.  ♦ 


13.4  THE  PART  II  DECISION  MODEL 

Beginning  with  the  set  b  of  acts  and  the  set  X  of  consequences  as  in  the 
Part  II  approach,  let  5  be  the  set  of  all  functions  on  Pto  X  (see  S'  in  Section 
12.1).  Then  the  subset  of  X  that  is  immediately  relevant  under  “state”  se  S 
is  X(s)  =  {s(J):f  e  F}.  For  many  s  e  S,  X(s)  will  be  a  proper  subset  of  X, 
and  for  each  constant  s  that  assigns  the  same  x  to  each  f,  2f(j)  =  {x}.  Hence 
the  horse-lottery  theory  of  Sections  13.2  and  13.3  cannot  be  used  in  estab¬ 
lishing  the  Part  II  model  unless  we  assume  that  consequences  other  than  those 
in  X(s)  can  be  considered  relevant  under  state  s. 

Suppose  in  fact  that  we  assume  the  extreme,  that  all  consequences  are 
relevant  under  every  state.  Then,  under  51-56  of  Theorem  13.3,  (13.3) 
follows.  With  fe  F,  Kc  x,  and  Pf(Y)  =  P*({s.s(f)  e  Y})  we  then  obtain 
E(u,  Pf)  —  E[E(u,  P(s)),  P*}  when  P(s)(s(f))  —  1  for  each  se  S.  Then  under 


186 


Axioms  xitk  Probabilities 


the  natural  definition  of  <  on  F  in  terms  of  <  on  X, 

f<  go  £(«,  Pf )  <  E(u,  P9),  for  all  f,geF,  (1 3.24) 

which  is  the  Part  FI  model.  Although  extraneous  probabilities  are  used  in 
deriving  this,  note  that  Pf,Pg,  . .  .  are  defined  from  P *,  which  may  have 
nothing  to  do  with  the  extraneous  probabilities  and  is  itself  derived  from  the 
axioms. 

Ft  may  of  course  be  stretching  things  too  far  to  assume  that  consequences 
not  in  AXj)  are  relevant  under  However,  it  may  be  possible  to  say  something 
about  state  probabilities  even  when  this  assumption  is  not  made. 

Suppose,  for  example,  that  F  =  {/,  g)  and  X  —  {win,  lose}.  Then  S  has 
four  elements:  sx(f)  =  sx{g)  =  win;  st(f)  =  win,  s2(g )  —  lose;  s3(f)  =  lose, 
s3  (g)  =  win;  s4(f)  =  s4(g)  =  lose.  Let  X  =  if  (if)  x  fT(jr2)  x  (f(x3)  x  $(s4). 
The  conditions  of  Theorem  13.1  then  give  P  <  Q  o  E(ut,  P(jj))  < 
Q ($,)).  Since  A^)  =*  {win}  and  .Y(.s4)  =  {lose},  the  first  and  fourth 
terms  drop  out  of  this  and  we  are  left  with 

p<  Qo  E(u2,  P(.v2))  +  E(u3,  P(s3))  <  E(u2,  Q(s2))  +  E(u3,  Q(s3)). 

(13.25) 

According  to  our  definition  st  and  s4  are  null,  but  this  is  only  because  Jf(5x) 
and  A"(j4)  each  contain  a  single  consequence:  the  declri'  maker  might 
consider  .?l  to  be  the  most  probable  state.  In  such  a  ca, ,  e  would  regard 
P*(s4)  and  P*(s4)  as  indeterminate  within  the  structure  of  our  axioms.  This 
indeterminacy  actually  causes  no  difficulty  since  the  i  —  1  and  i  =  4  terms 
do  not  appear  in  (13.25). 

With  regard  to  s3  and  s3,  (T(j2)  =  $(s3)  since  X(s3)  =  AX^a)  =  (win,  lose}. 
If  condition  A5  of  Theorem  13.2  is  used  it  follows  from  (13.25)  that  (assuming 
some  strict  preference)  there  are  A2  ^  0  and  X3  0  with  A2  -f  As  >  0  and 
there  is  a  real-valued  function  u  on  X  such  that  P  <  Q  o  X3Z(u,  P(.?2))  + 
X3E(u,  P(ja))  <  X2E(u,  Q(j2))  +  X3E(u,  Q(ss)).  Here  we  would  interpret  A* 
and  ns  oropoiuonai  to  P  ’Sst)  anu  r  -\s3)  respectively  .»o  ‘.hat,  if  4a  >  0, 
XJX,  =  P*(st)jP*(s3).  If  w(win)  >  w(lose)  it  is  easily  seen  from  this  that 
f<goP*(Si)<P*(s9). 

13.5  SUMMARY 

The  usual  states  expected-utility  decision  model  can  be  derived  from  axioms 
that  involve  extraneous  probabilities  when  there  is  sufficient  overlap  among 
consequences  considered  relevant  under  different  states.  When  the  number  of 
states  is  finite,  the  assumption  that  there  are  two  nonindifferent  consequences 
that  are  relevant  under  every  state  is  sufficient.  For  the  more  general  case,  in 
which  the  size  of  S  is  arbitrary,  it  was  assumed  that  all  consequences  are 


Exeretus 


187 


relevant  under  every  state.  Even  when  there  may  be  no  overlap  among  the 
consequences  under  different  states,  the  expected-utility  axioms  of  Chapter 
8  when  applied  to  horse  lotteries  with  S  finite  lead  to  an  additive-utility 
representation  that  is  similar  to  additive  forms  of  Section  11.1  and  (12.11). 

Although  the  horse-lottery  approach  presumes  a  continuum  of  extraneous 
probabilities,  this  appears  to  be  offset  by  its  general  applicability  since  it 
places  almost  no  restrictions  on  the  sizes  of  S  and  X.  Moreover,  it  places  no 
unusual  restrictions  on  the  utility  function  on  X  or  on  the  probability 
measure  P*  on  S. 


INDEX  TO  EXERCISES 

1,  Insufficiently  connected  X(s).  2.  P*  =  1  for  a  finite  subset.  3.  Intersections  of  parti¬ 
tions.  4.  54.  5-6.  Zero-one  measures  and  54.  7.  Additivity  when  X  =*  II  Xt,  8-15.  P(r)  < 
Q(j)  L,r  all  s  fc  i  =>  P  ^  Q.  16-19.  Even-chance  theory. 


Exercises 

1.  Let  5  =  fo,  s2,  J3},  X(st)  =  R  y,  z,  h-},  X(s^  =  {x,  y,  r,  /},  Jf(,is)  *  (z,  w, 
r,  /}  Let  the  hypotheses  of  Theorem  13.2  with  the  exception  of  A4  hold,  and  let  the 
following  values  of  «,  (/  —  1, 2,  3)  satisfy  (13.2): 


x  y  z  w  r  t 


a.  Verify  that,  for  each  l,j  there  is  a  unique  ri}  such  that  =*  rt>ic(/>)  for  all 
P  e  X(Si)  n  X(Sf).  Is  /"23  =  rw/r„  ? 

b.  Show  that  it  is  impossible  to  define  u  on  X  —  (J  X{st)  and  P*  on  5  so  as  to 
satisfy  (13.3). 

2.  Prove  that  if  P*(A)  *»  1  for  some  finite  A  £  S  then  51-55  imply  (13.3)  in 
the  structural  context  of  Section  13.3. 

3.  Let  3>  be  a  set  of  partitions  of  5.  Show  that  {flntJ)  /(£)  e  D  for  each 
D  6  3>,  nz>*a)  /(£)  ^  0 }  is  a  partition  of  S. 

4.  In  connection  with  54  and  its  proof,  suppose  that  A*,  A3, ...  is  a  sequence 
of  partitions  of  5  such  that  (1)  A"  contains  exactly  n  events,  each  with  positive 
probability,  and  (2)  A  e  A*+1  =>  A  s  5  for  some  5  e  A".  Show  that  it  may  be 
impossible  to  select  one  event  from  each  A"  so  that  the  selected  events  are  mutually 
disjoint. 


4xti rtctt  wift  i.  X I  tst*r**r  <  Pr&UftUti/ic} 


MW 

5.  Let  P *  on  .S'  be  defined  in  such  a  way  that  thste  is  a  set  .4  of  subsets  of  £  such 
that  P*M)  =  i  if  4  <  a,  andi>*t-4>  —  0  tfi  A  f  A.  Prove  that  if  f).*  >4  ?*  0  then 

this  intersection  contains  exactly  one  s  and,  for  this  s,  P*(s)  =  1. 

6.  Let  £  be  infinite  and  let  s  be  the  set  of  all  sets  a  of  subsets  of  £  that  have  the 
following  four  properties: 

1.  0  6  A  and  {j}  e  A  for  all  s  e  S; 

2.  A  e  A=>  Ae£  A; 

3.  A,  B E  A  A  (J  B E  A\ 

4.  (A  E  A,  B  £  A)  =>  Be  A. 

a.  Is  the  set  of  all  finite  subsets  of  £  in  9  ? 

b.  Use  (4)  to  show  that  (AeG,  A  u  Be  a)  =>  A,  B e  A. 

c.  Prove  that  (AeG,AEA,BfA)=>A  v  B$  A. 

a .  Use  Zorn’s  Lemma  to  prove  that  there  is  an  AeQ  that  is  maximal  with 
respect  to  (1)  through  (4).  Let  A*  be  maximal  (if  A*  <=■  A'  then  A'  $  6)  and 
let  S>*  be  the  set  of  all  subsets  of  £  that  are  not  in  A*, 
e.  Prove  that  A,  Be  &*  =>  A  C\  B  &  0.  For  this  suppose  that  A,  Be  and 
A  r>  B  =  0 .  Then  show  that  =  A*  u  {C  D:C  £  A,  De  /;*} 's  in  s, 
contradicting  the  maximality  of  A*. 

f  Let  =  0  if  A  e  A*  and  P*(A)  =  1  if  A  e  &*.  Show  that  P*  is  a  prob¬ 
ability  measure  on  S.  Note  that  (Jffl’  A  =  0  and  compare  with  the  preceding 
exercise. 

g.  Explain  why  the  failure  of  the  hypotheses  of  £4  does  not  imply  that  P*  on  S 
is  a  simple  probability  measure. 

7.  Suppose  the  hypotheses  of  Theorem  13.3  hold  and,  in  addition  X  =  Xt 
and,  with  Pf  the  marginal  of  P€  $  on  X},  (P  =  P  on  £,  Q  =  Q  on  S,  /*,  =  Qf 
for  j  —  1,  ...,«)=>  P  ~  Q.  Show  that  there  are  real-valued  functions  ult...,u„ 
on  Xt, . . .  ,  X„  respectively  such  that 

P  <QoflElE(uf,  P (*),).  P*J  <%E[E(Ui,  Q(r)*)»  P%  for  all  P,  QeX 

where  P(s)t  is  the  marginal  of  P(s)  on  Xy 

Note:  Exercises  8-15  are  set  in  the  context  of  Section  13.3.  Axiom  B1  is:  P(r)  < 
Q(s)  for  all  sE  S  =>  P  <Q. 

8.  Prove  that  (Bl,  B7,  Ae  is  null,  P (s)  <  Q(s)  all  jf  «)->?■<  Q. 

9.  Prove  that  (Ei,  Bl)  =>  if  P  =  P  and  Q  =  Q  on  A,  P  =  Q  on  Ac,  and  P  <Q 
then  P  <Q.  (This  is  one  half  of  B5.) 

10.  By  a  straightforward  partition  proof  show  that  if  X  has  a  least  preferred  and 
a  most  preferred  consequence  then  (£1-55,  Bl)  =>  (13.3). 

11.  Show  that  (fil-55,  Bl,  there  is  a  denumerable  partition  A  of  S  for  which 
P*(A)  >  0  for  every  A  e  j*)  =>  u  on  X  is  bounded.  (Use  51.) 

12.  (Continuation.)  Prove  £2  when  B6  in  its  hypotheses  is  replaced  by  Bl.  To  do 
this  you  need  only  verify  (13.12)  when  P*(A)  *  1  and  c  and  d  are  finite.  This  is  the 
critical  point  for  B6  and  therefore  for  Bl. 


Lxtrcutt 


m 


13.  (Continuation.)  Use  Exercises  11  and  12  to  argue  shat  the  conclusions  of 

Theorem  13,3  .  e  valid  when  56  in  its  hypotheses  is  replaced  by  Bl. 

14.  Let  S  =  {1,  2,  3, .  .  X  =  [0.  1),  u(r)  =  .r,  and  let  P*  be  a  probability 

measure  on  S  that  has  P*(s)  =  0  for  s  =  1,2 . Suppose  P  -<Q<=>t‘(P}  < 

v(Q)  where 

J.(P)  =  E[E(us  P(.s)),  P*]  -4-  inf  {E[P(j){t  >  1  -  >  0}, 

with  P(j){*  ;>  1  -  *}  the  probability  assigned  by  the  simple  measure  P(s)  to  the 
subset  {x:x  ;>  1  —  t,  x  e  A"}  of  X.  Show  that  Bl-BS  hold  and  that  Bb  and  Bl  do 
not  hold. 

15.  ( Continuation .)  Let  S,  X,  u,  and  P*  be  as  given  in  Exercise  14  with  P*{1,  3, 
5, . . .}  -  P*{2,  4,  6, . . .}  =  1/2,  and  let  P  -<  Q  <^u(P)  <  v(Q)  where 

t-(P)  -  E[E(u,  P(.?)),5*]  +  inf  {P*{E(u,  P(s))  £  1  -«}:«>  0}. 

a.  Prove  that  (0  <  a  <  1,  P,  R£Je)=>  inf  {P*{E(u,  aP(i)  +  (1  -  a)R (j»  ^ 

1  -  <:<  >0}  =  inf  {P*({E(u,  P(s))  ^  1  -  <}  n  {E(a,  K(s))  £  1  -*}):«>  0}. 

Note:  {£(«,  P(s))  >  1  -  <}  =  {s'.  Fiji,  P(s»  >  1  -  e}. 

b.  Show  that  Bl,  B4,  B 5,  and  Bl  hold. 

c.  By  specific  example,  show  that  B2  does  not  hold. 

d.  By  specific  example,  show  that  53  does  not  hold. 

e.  By  specific  example,  show  thQ*  56  does  not  hold. 

Note:  In  the  remaining  exercises,  F  is  the  set  of  all  functions  on  S  to  X,  (f,g)  e  F2 
is  interpreted  as  an  even-chance  alternative ,  x*  is  the  act  in  F  that  assigns  x  e  X  to 
every  seS,  and  A  £  5  is  null  <=>  (f,g)  ~  ( f,g ')  whenever  if{s),gis))  =  (/'(■*), 
g\s))  for  all  s  e  Ae.  Let  D\  through  Dl  be  the  following  axioms: 

Dl.  -<  on  F  x  F  —  F2  is  a  weak  order. 

di.  [(/,/)  <  (/',**),  {f,g)  <  (f*g')]=>(g.f)  <  (s*,rx 

D3.  {X,  3?)  is  a  connected  and  separable  topological  space. 

DA.  {(/»:(/,*)  6F1,  (f,g)  <  {f' , g')}  G  3S*n  and {(/, g) :(f,g)E  F2,  ( f',g ')  -< 
if,g)}  e  -B ^  for  each  if  ,g')  €  F2. 

D5.  (x*.  x*)  -<  ( y *,  y*)  for  some  x,yeX. 

D6.  {A  is  not  null;  f  -  x,g  =  y,f  =  z,g  =  won  A;  {f(s),g(s)}  «  {f'(s),  g'(s)} 
for  each  seAc)^>  [(**,  y*)  <  (z*,  w*)o  ( f,g )  <  if  ,g')\ 

Dl.  [if  is)*,  gis)*)  <  if,  g')  for  all  seS\  =>  if,  g)  <  (/',/);  [if,  g)  < 
if  is)* ,  gis)*)  for  allseS]  =>  if  ,g')  if,g). 

16.  Prove  that  if  S  is  finite  and  Dl-DS  hold  then  there  is  a  real-valued  function 
u  on  X  and  a  probability  measure  P*  on  S  such  that 

(f,g)  <  if,g')^E[uifis)),P*}  4-  E[«(£(j)),P*] 

<•  E[uifis)),P*}  +  E[uig’is)),P*] 
for  all  (f,g),  (f^g^eF2,  with  P*  unique  and  u  unique  up  to  a  positive  linear 
transformation  when  this  holds. 

17.  Let  S  be  of  any  size  and  assume  that  there  is  a  real-valued  function  v  on  F 
that  satisfies 

if,g)  <  (f\i‘)ov(f)  +  v(g)  <  v(f)  +  v(g'),  for  all  (f,g),  if  ,g')  e  F* , 


190 


Axiom  with  Extrmoecms  Probabilities 


and  is  unique  up  to  a  positive  linear  transformation  when  it  satisfies  this.  Assume 
also  that  the  restriction  of  v  to  {x*:?  .  A'}  is  unique  up  to  a  positive  linear  trans¬ 
formation  wt'en  it  satisfies 

(x*,y*)  <  (*.*,  w*)o v(x*)  +  v(y*)  <  r(r*)  +  v(w*),  for  ail  x,y,z,  we  X, 
and  that  £>5  and  DC  hold. 

Let  F0  —  {/: f  e  Fand  {/(s) :j£S}  is  finite}.  Prove  that,  with  u(x)  =  v(z*)t  there 
is  a  unique  probability  measure  P*  cn  5  such  that 

v(f)  =  E[u(f(s)),P*)  (13.26) 

for  all /e  F0,  with  A  null  =>  0.  (Compare  with  51.) 

18.  ( Continuation .)  Along  with  the  assumptions  of  the  preceding  exercise 
assume  that  Z>7  holds  and  that  for  any  x,y  e  X  there  is  a  z  e  A'  such  that 

(**,y*)~(z*,z*l  (13.27) 

Call  fe  F  bounded o  P*{a  <,  u(f(s))  £  b)  —  1  for  some  numbers  a  and  b.  Use  the 
following  steps  to  prove  that  (13.26)  holds  when  /is  bounded. 

a.  Show  that  P*{a  <,  u(f(s))  <,  b)  —  \  =>  a  <,  v(f)  ^  b.  [Let  A  =  {s\a  <, 
u( / (s))  ^  6},  let  d  —  sup  {n(/(i));ieA},  suppose  d  <  v(f)  and  use  D1  to 
obtain  a  contradiction.] 

b.  With/bounded  on  A  and  P*(A)  =*  1 ,  for  convenience  assume  inf  {u(/(s)):s  e 

A)  =  0  and  sup  {«(/(j)):j€/(}  =  I,  and  assume  (with  no  loss  in  generality 
since  Ae  is  null)  that  0  1  w(/(j))  ^  1  on  Ae.  Given  a  positive  integer  n  let 
At  =  fs:0  <,  u(/(s))  <,  1/n),  A,  «{*:(/-  \)!n  <  u{f(s))  <>  i/n)  for  i  = 
2 . n.  By  (13.27)  there  are  xi  e  X  for  which  (i  —  l)/n  ^  u{xt)  <;  i/n  for 

Define/,  6  F  by 

/  =/on  A^,  /<  «  on  Af  (/ -  1 

gi  -  *<+i  on  U/-i  <f.  =  *»•  on  U?-,+i  *i  0  =  1 . n  -  1). 

Use  the  fact  that  {f\s),  g'{s')}  =  {f"(s),g'(s)}  for  all  seS  implies  ( f’,g ')  ~ 
( [f,g *)  along  with  the  first  o  expression  in  Exercise  17  to  prove  that 

v{f)  +t»igd  =  |  p(/>.  (13.28) 

c.  Under  the  conditions  in  (b),  it  follows  from  (13.27)  that  for  any  <  >  0  and 
any  1 6 {1 , . . . ,  n}  there  are  xeX  with  \u{x)  -  i/n\  <  t.  Use  this,  (13.28), 
(13.26)  for  F0,  and  the  bounds  on  the  v(/)  implied  by  step  (a)  to  show  that 

P*{Ax)(i  -  l)//i  -  l/«  <,  v(f)  <,  P*(Ai)il'n  +  1/n,  Then  argue  that 
v(f)~Elu[f{S)),P*]. 

19.  ( Continuation .)  Under  the  assumptions  of  the  preceding  exercise  prove  (a) 
a  on  AT  is  bounded  if  there  is  a  denumerable  partition  of  S  that  has  P*(A)  >  0  for 
every  A  in  the  partition,  ( b )  every  /e  F is  bounded. 


Chapter  14 


SAVAGE’S  EXPECTED- 
UTILITY  THEORY 


The  most  brilliant  axiomatic  theory  of  utility  ever  developed  is,  in  my 
opinion,  the  expected-utility  theory  of  Savage  (1954).  It  is  an  eminently 
suitable  theory  with  which  to  conclude  this  book. 

As  has  been  true  of  significant  developments  throughout  the  history  of 
mathematics.  Savage’s  theory  was  not  developed  in  a  vacuum.  He  acknowl¬ 
edges  and  draws  on  the  prior  ideas  of  Ramsey  (1931),  de  Finetti  (1937),  and 
von  Neumann  and  Morgenstern  (1947).  His  general  approach  is  not  unlike 
that  presented  by  Ramsey  in  outline  form.  Unlike  Ramsey,  who  proposed  to 
first  derive  utility  on  the  basis  of  an  “ethically  neutral  proposition”  or  even- 
chance  event  and  then  to  derive  prooabilities  on  the  basis  of  utilities.  Savage 
reverses  this  procedure.  In  his  axiomatization  of 

/ <  g  <=>  E[u(J(s)),  /»*]  <  F[u(g(5)),  P*],  for  all/,  geF, 

which  is  based  solely  on  the  binary  relation  <  on  F,  Savage  first  obtains  the 
probability  measure  P*  on  the  set  of  subsets  of  S.  This  development  owes 
much  to  de  Finetti’s  work  in  probability  theory.  Using  P*  Savage  then 
obtains  a  structure  much  like  that  used  by  von  Neumann  and  Morgenstern 
in  their  utility  theory  (Theorem  H,2),  ana  proceeds  to  specify  u  on  X.  One 
final  axiom  then  leads  to  the  above  representation  on  all  of  F.  Savage's 
theorem  is  given  in  the  next  section  which  contains  also  an  outline  of  later 
sections. 

14.1  SAVAGE’S  EXPECTED-UTILITY  THEOREM 

The  main  purpose  of  this  chapter  is  to  explore  Theorem  14.1,  which  may 
appropriately  be  referred  to  as  Savage’s  expected-utility  theorem.  This 
section  presents  the  theorem  and  discusses  its  conditions  and  conclusions. 
Some  preliminary  definitions  are  required. 


191 


m 


Stngt's  kxpttttd-Ulitiix  Tkf&y 


S  is  the  set  cf  states,  X  is  the  set  of  consequences,  and  F  is  the  set  of  all 
functions  on  5  into  X  A,  B  c  S:  x,  u  p  X,  f.gcF.  <  *>  F  is  the  basic 

binary  relation  with  ~ and  <  defined  in  the  usual  way:  / ^  ->  (not  f  <  g, 

not  g  < /),  and  f%go(f  <  g  or/—  g). 

f  =  g  on  A  <>  f(s)  —  g(s)  for  all  s  e  A.  f  =  x  on  A  o  f{s)  —  x  for 
all  je/i.  Partitions  of  S  and  complements  are  defined  as  in  Section  13.1. 
A  is  nvF  o /—  g  whenever /  »  g  on  Ae. 

x  <  y  of<  g  when  f  ~  x  and  g  —  y  on  S.  x  <  / o  g  <  /  when 
g  =  t  on  S.  Similar  definitions  hold  for  z  —  y,/—  y,  x  K.f,  and  so  forth. 

Conditional  preference  is  defined  as  follows:  f  <  g  given  A  <=>/'  <  g ' 
whenever/ =  /'  andg  =  g'  on  .4,  and /'  =  g'  on  Ae,~  given  A  and  ^  given 
A  are  defined  in  the  usual  way.  x  <  g  given  A  means  that  f  Kg  given  A 
whenever /—  x  on  A. 

THEOREM  14.1.  Suppose  that  the  following  seven  conditions  hold  for  all 
f,  S' e  f>  4*  #  £  S,  and  x,  y,  x\  y'  e  X: 

PI.  <  on  F  is  a  weak  order ; 

P2.  (f  =/'  and  g  =  g'  on  A,  /=  g  and  f  =  g  on  Ac)^>(f  <  go 

r  <  g'K 

P3.  (A  is  not  null ,/  =  x  and g  =  y  on  A)  =>  (/ <  g given  A  ox  <  y); 
P4.  [(x  K  y,  f  —  y  on  A,  f  =  z  on  Ac,  g  =  y  on  B,  g  =  x  on  Be )  and 
(x'  <  y\  f  —  y'  on  A,  f  —  x'  on  Ac,  g'=  y'  on  B ,  g'  =  x  on  Be)]=> 

(fKgof'Kgy* 

P5.  x  <  y  for  some  x,  y  e  X; 

P6.  (f  <  g,  x  e  X)  =>  there  is  a  finite  partition  of  S  such  that,  if  A  is  any 
event  in  the  partition ,  then  (/'  =  x  on  A,  /'  =/  on  Ac)  =>  f  <  g,  and 
(g'  =  x  on  A,  g'  =  g  on  Ae)  =>/<  g 

PI.  (/  <  g(s)  given  A ,  for  all  s  £  A)=>ffg  given  A.  (g(s)  K  f  given  A, 
for  all  s  £  A)  g  K  f  given  A. 

Then ,  with  <  *  defined  on  the  set  of  all  subsets  of  S  by 
A  K*  B  <=> f  K  g  whenever  (x  <  y,f=  yon  A, 

f  =  x  on  Ac ,  g  —  y  on  B,  g  =  x  on  Bc),  (14.1) 

there  is  a  unique  probability  measure  P*  on  the  set  of  all  subsets  of  S  that 
satisfies 

A  <*  B<z>  P*{A)  <  P*(B),  for  all  A,  B  c  S,  (14.2) 
and  P*  has  the  property  that 

(B  s  S,  0  ^  p  <,  1)  =>  P*(C)  -  PP*(B )  for  some  C  S  B :  (14,3) 

and,  with  P*  as  given ,  there  is  a  real-valued  function  u  on  X  for  which 

f<goE[u(f{s)),P*)<E[u(g{s)),F*\,  forallfgeF,  (14,4) 


Savage's  Expecled-Uti'ify  Theorem 


193 


and  when  u  ~atisf.es  this  it  is  bounded  and  unique  up  to  a  positive  linear 
transformation. 

The  final  condition,  PI,  is  similar  to  A4a  of  Section  10.4  and  B6  of  Section 
13.3.  It  is  an  ob’  ious  dominance  (or  sure-thing,  or  independence)  condition 
and  it  is  not  required  in  the  derivation  of  P*  that  satisfies  (14.2),  just  as  B6 
was  not  required  in  the  derivation  of  P*  for  Theorem  13.3.  The  form  of  PI 
given  in  the  theorem  is  slightly  weaker  than  (does  not  assume  as  much  as) 
Savage’s  original  form  which  has  ^  where  <  appears  in  PI,  but  the  two  are 
equivalent  in  the  presence  of  the  other  conditions,  P1-P6. 

P2  and  P3  explicate  Savage’s  “sure-thing  principle.”  P2  says  that  prefer¬ 
ences  between  acts  should  not  depend  on  those  states  that  have  identical 
consequences  for  the  two  acts.  It  is  closely  related  to  the  independence 
condition  of  Chapter  8  and  is  found  reasonable  by  many  persons  provided 
that  the  state  that  obtains  does  not  depend  on  the  act  that  is  actually  imple¬ 
mented.  Together,  PI  and  P2  imply  that  <  given  A  is  a  weak  order  on  F  for 
every  A  £  S. 

P3,  as  a  companion  to  P2,  says  that  if  f  =  x  and  g  =  y  on  A  and  if  A  is 
not  null,  then  f  <  g  given  A  of  <  g‘  when  f  =  x  and  g'  —  y  on  S.  This 
sets  up  a  reasonable  correspondence  between  preferences  on  consequences 
(constant  acts)  and  conditional  preferences  on  events  that  the  decision  maker 
regards  as  possible. 

<*  as  defined  in  (14.1)  is  a  qualitative  probability  relation  on  the  set  of 
events.  We  read  A  <  *  B  as  “/l  is  less  probable  than  P.”  As  noted  in  (14.1), 
“is  less  probable  than”  is  defined  in  terms  of  “is  less  preferred  than.”  The 
principle  objective  of  P4  in  this  connection  is  to  ensure  that  <*  on  the  events 
is  a  weak  order.  Suppose  you  prefer  y  to  x  and  can  either  take  your  chances 
on  getting  y  if  A  obtains  or  on  getting  y  if  B  obtains.  In  either  case  if  the  event 
you  tak^  your  chances  on  does  not  obtain  you  will  receive  the  less  preferred  x. 
If  you  select  B  then  it  seems  reasonable  to  suppose  that  you  regard  B  as  more 
probable  than  A.  P4  says  that  if  you  prefer  to  take  your  chances  on  getting 
y  if  B  obtains  then,  with  two  other  consequences  y'  and  x'  with  y'  preferred 
to  x' ,  you  would  (or  ought  to)  rather  take  your  chances  on  getting  y'  if  B 
obtains  than  on  getting  y'  if  A  obtains.  As  in  the  case  of  P2,  this  seems 
reasonable  as  long  as  the  state  that  obtains  does  not  depend  on  the  conse¬ 
quences  assigned  to  the  states  by  any  particular  act. 

PS  says  that  indifference  does  not  hold  between  every  pair  of  constant  acts. 
It  is  needed  to  ensure  the  uniqueness  of  P*.  If  P5  were  false  then  <  *  would  be 
reflexive.  For  further  remarks  see  Section  14.3. 

The  effect  of  P6,  which  is  a  rather  strong  assumption,  can  best  be  seen 
from  (14.3)  which  in  the  presence  of  (14.2)  follows  from  P1-P6.  Among  other 
things,  (14.3)  says  that  S  must  be  uncountable,  that  P*(s)  =  0  for  every 
s  e  S,  and  that  for  any  positive  integer  n  there  is  an  n-event  partition  of  S 


Ssegft  • 


?**• 


1^4 


*4 


htfvtttg  r-*A  r—  It?  ;—f  carr  .T  ir  me  pan;:.-,  W;  -IiaTT  *cL‘?  ;  •  H-ift 

partitions  as  uniform  partition o 

In  guaranteeing  ^  -  *  B  <->  P*t  A)  <  P*(B),  P*>  has  an  unmistakable 
Archimedean  quality.  In  effect  it  says  that  no  consc  uencc  is  “infinitely 

desirable”  (which  would  negate /'  <  g  if  x  were  so  dwsirabie)  and  that  no 
consequence  is  “infinitely  undesirable”  (which  would  negate /<  g'  if  x  were 
so  undesirable).  If  S  is  allowed  to  be  infinite,  something  like  P6  is  required 
to  ensure  the  existence  of  real-valued  order  (<  *)  preserving  probabilities* 
As  Savage  points  out,  weaker  versions  of  P6  are  sufficient  for  (14.2),  but  may 
not  yield  (14.3)  as  well.  The  usefulness  of  (14.3)  will  become  apparent  when 
we  see  how  it  is  used  as  a  point  of  departure  in  defining  gambles  on  X  that 
lead  to  the  definition  of  the  ufJ1»ty  function  u  on  X. 

Pi  through  Pb  are  sufficient  to  obtain  (14.4)  for  all  acts  in  irthat  assign  no 
more  than  a  finite  number  of  consequences  to  all  the  states  in  some  event  A 
for  which  P*(A )  —  1.  PI  is  then  used  (as  was  B6  in  the  preceding  chapter)  to 
verify  that  (14.4)  holds  for  all  acts,  and  it  ensures  that  u  on  X  is  bounded. 
When  he  wrote  The  Foundations  of  Statistics ,  Savage  had  the  impression  that 
P\-P1  do  not  imply  that  u  is  bounded.  Some  years  later,  when  we  were 
working  on  the  theory  that  appears  in  Chapter  10  of  Part  II,  we  discovered 
that  this  impression  was  false.  Because  of  the  false  impression,  Savage  did 
not  state  (14.4)  for  all  acts  but,  in  light  of  the  boundedness  of  u,  he  did  in 
fact  prove  (14.4)  as  it  is  presented  here.  In  other  words,  he  proved  (14.4)  for 
all  bounded  acts.  Since  u  is  bounded,  all  acts  are  bounded.  The  proof  of 
boundedness  given  later  is  essentially  his. 

In  proving  Theorem  14.1  I  shall  follow  the  pattern  used  by  Savage.  Here  is 
a  sectional  outline. 

Section  14.2  shows  that  (14.2)  and  (14.3)  follow  from  five  conditions 
(FI-F5)  for  <  *  on  the  set  of  events. 

Section  14.3  under  definition  (14.1)  shows  that  PI-P6  =>  FI-F5. 

Section  14.4  establishes,  from  PI-P6,  the  three  preference  axioms  of 
Theorem  8.2,  which  shows  that  (14.4)  holds  for  acts  confined  with  probability 
one  to  a  finite  subset  of  consequences. 

Section  14.5  proves  that  u  on  X  is  bounded.  This  uses  PI. 

Section  14.6  uses  PI  to  verify  (14.4)  for  all  acts. 

The  proofs  that  follow  are  essentially  Savage’s.  I  have  added  some  details 
to  them  in  places  where  I  felt  that  this  would  aid  some  readers. 


14.2  AXIOMS  FOR  PROBABILITY 
In  this  section  we  shall  prove  the  following  theorem. 

THEOREM  14.2.  Suppose  that  <*  on  the  set  of  all  subsets  of  S  satisfies  the 


Axioms  f<H  Probability 


195 


following  conditions  for  all  A,  B,  C  c  S: 

FI.  not  A  <*  0  , 

F2.  0  <  *  S, 

F3.  <  *  is  a  weak  order , 

FA.  A  nC  =  BnC=0=>(A<*BoA\jC<*BuC), 

F5.  A  <*  B=>  there  is  a  fini.~  partition  {Cu  ... ,  Cm}  of  S  for  which 
A  u  C,  < *  B for  i  —  1 , ...  ,m. 

Then  there  is  one  and  only  one  probability  measure  P*  on  the  set  of  all 
subsets  of  S  that  satisfies  (14.2),  and  (14.3)  holds  for  this  measure. 

Fl-FA,  which  define  <  *  as  a  qualitative  probability ,  are  necessary  for  (14.2), 
but  collectively  they  are  not  sufficient.  FI-F5  as  noted  are  sufficient  but  F5  is 
not  necessary  for  (14.2)  although  it  does  follow  from  (14.2)  and  (14.3). 

As  usual  we  define  A  B  o  (not  A  <*  B,  not#  <*  A),a.ndiA  <*  Bo 
(A  <  *  B  or  A  B).  Throughout  this  section  and  the  rest  of  this  chapter, 
1  shall  use  AjB  (“ A  but  not  B")  to  denote  the  complement  of  B  relative  to  A: 

A/B  —  A  r\  B°.  (14.5) 

In  approaching  Theorem  14.2  we  begin  with  a  series  of  consequences  of 
Fl-FS.  Cl  through  C4  presuppose  only  Fl-FA.  The  rest  presuppose  all  of  FI 
through  F5. 

Cl.  Be  C=>  0  <*#<*£<*#. 

C2(~ *).  (A  B,  B  n  C  ==  0)=>  A  u  C  <*  B  yj  C. 

C2« *).  (A  <*  B,  B  n  C  =  0)^>  A  u  C  <*  B  u  C. 

C3(~*).  (A~*  B,C~*  D,B  r\  D  =  0)=>AuC^*Bu  D. 

C3«*).  (A  <*#,  C<*  D,3  n  D  =  0)=>A\jC<*  Bkj  D. 

C4.  (A~*  B,C~*  D,  A  r\  C  =  B  C\  D  =  0)=>A  u  C~*  Bu  D. 

C 5.  0  <  *  A  =>  A  can  be  partitioned  into  two  events  B  and  C  for  which 
(0  <*  P,  0  <*  C). 

C6.  (A,  B,  and  C  are  pairwise  disjoint,  A  =<>  B,  B  <  *  A  u  C)  there  is 
a  D  £  C  for  which  0  <  *  D  and  B  u  D  <  *  A  yj  ( C[D ). 

Cl.  (0  <*  A,  0  <  *  B,  A  n  B  =  0)=>  B  can  be  partitioned  into  C  and 
D  for  which  C  ^  *  D  *  A  \j  C. 

C8.  0  <  *  A  =>  A  can  be  partitioned  into  B  and  C  with  B  C. 

C9.  0  <  *  A  =>  for  any  positive  integer  n  there  is  a  2"  part  partition  of  A 
such  that  '•*-'*  holds  between  each  two  events  in  the  partition. 

Proofs  of  C\  through  C9 

Cl.  The  proof  is  easy  and  is  left  to  the  reader. 


1% 


Savage's  Expected- Utility  Theory 


C2(~*)-  Assume  (A  B ,  B  r\  C  —  0).  Since  A  =■  (A/C)  u  (A  DC) 
and  A  n  (C/A)  =  0 ,  F4  (A/C)  u  (A  n  C)  u  (C/A)  B  u  (< C/A ),  or 
4  u  C~*  5  u  (CfAj.  By  Cl,  fu  (C/A)  <*5uC.  Hence,  by  F3, 
Au  C<*  5u  C. 

C2«*).  Replace  by  <*  in  preceding  proof. 

C3(~*).  Assume  (.4  B,  C~*  D,  B  n  D  =  0).  Since  (C/5)  OH* 
0 ,  C2(~ *)  =>  A  u  ( C/B )  <*Su  (C/5)  *  5  u  C.  Also,  since  (B/C)  n 
5=0,  C2(~*)  and  C  5  imply  5  u  C  *  C  u  (5/C)  <  *  5  u  (5/C). 
By  53,  ,4  u  (C/5)  <  *  5  u  (5/C).  This,  C2,  and  (B  nC)n(Dyj  (5/C))  = 
0  then  imply  that  >4  u  (C/5)  u  (5  n  C)  5  u  (5/C)  (5  n  C),  or 

/4uC<*5u5. 

C3« *).  Replace  C  5  by  C  <  *  5  in  preceding  proof.  Use  C2(<  *). 

C4.  Assume  (/I  — *  5,  C — *  5,  ,4  n  C  =  5  O  D  =  0).  By  C3(~*), 
y4uC<*5u5  and  5  u  5  < '.  *  A  u  C.  Hence  A  u  C  5  u  D. 

C5.  Assume  0  <*  A.  FJ=>  there  is  a  partition  {D  , , . . ,  An}  of  S  for 
which  D,<*  A  for  each  i.  Cl  =>  A  r\  A  < *  (A  n  /4)  u  (DJA)  =  Dx. 
Hence  A  C»  A  <  *  A  for  all  1.  If  A  n  A  0  for  each  /  then,  by  C4, 
U,  (A  y4)  0 ,  or  y4  0 ,  a  contradiction.  If  0  <*  Di  n  A  for 

only  one  /,  say  /  =  1,  then  /4  Dx  n  A  which  contradicts  Dx  n  A  <  *  A. 
Hence  0  <*  A  r\  A  for  at  least  two 

C6.  Assume  (A  C\  B  —  A  n  C  =  5  O  C  =  0 ,  A  <  *  5,  5  <*  A  vj  C). 
(Fi,  F4)=>  0  <*  C.  Since  5<*4uC  and  0  <*  C,  it  follows  from  F5 
that  there  is  a  A  £  C  for  which  0  < *  A  and  B  \j  Dx  <*  A  u  C.  By 
C5  and  F3,  A  can  be  partitioned  into  D  and  D’  with  0  <*  5  A,  so 
that  5  u  5  u  5'  <*  u  (C/D)  v  D.  F4^>  B  \j  D'  <*  A  kj  (C/D).  (FA, 
D  <*  A)=>  5  u  5  5  u  5'.  Hence  B  \j  D  <*  A  v  (C/D). 

Cl.  Assume  (0  <*  A,  0  <*  B,  A  nS  =  0).  If  5  *  v4  the  conclusion 

follows  easily  from  C 5.  Assume  that  /4  <  *  5.  FS  =>  there  is  a  partition 
{G,, ....  G„}  of  5  such  that  <*  A  for  each  i.  For  definiteness  assume 
that  Gx  <*•••<*  G„.  Let  m  be  such  that  Uxm  A  <  *  U«+i  A  <  *  U^1  A- 
Let  C  =  UT  A  {ind  F>  =  Um+i  G(,  Then  C^*D^*Cu  Gm+1  which, 
since  Gm+X  <*  A,  implies  by  F4  and  F3  that  D  <*  C  u  A. 

C8.  Assume  0  <  *  A.  It  follows  from  C 5  that  A  can  be  partitioned  into 
5j,  Ci,  A  such  that  5j  *  Cx  u  A  and  Cx  *  Bx  u  A*  Tf  one  of  these  two 
< *  is  «*'*,  the  conclusion  of  C8  holds.  Henceforth  assume  that  Bx<*  Cx\j 
A  and  Cx  <*  Bx  u  Dx.  Then  0  <*  Dx.  For  definiteness  take  Bx  =<*  C,. 
Then  C6  =>  there  is  a  C*  c  A  such  that  0  <  *  C*  and  Ct  u  C2  ^  *  flj  u 
(DJC*).  Hence  0  <*  A/C*  a°d,  by  Cl,  DJC*  can  be  partitioned  into  5* 
and  5,  such  that  5*  <  •  5a  <  *  C8  u  5*.  Since  Bx  <  *  Clt  5X  u  5*  <  * 
A  u  5a  <  *  Cx  u  5a  u  C*.  all  by  F4.  Let 


5g  «s  5X  u  5*  and  C*  =  Ct  u  C*. 


Axioms  for  Probability 


197 


We  then  obtain  a  partition  {B2,  C2,  D2)  of  A  for  which 

1.  Bs  < *  C2  u  D2  and  C2<*  B2u  D2, 

2.  Bx  ^  B2,  C\  —  Cj,  D 2  £ 

3.  ^  *  Dil D2. 

By  repeating  this  pre-ess  it  follows  that  there  is  a  sequence . . . ,  { Bn ,  Cn, 
£>„}, ...  of  three  part  paititions  of  A  such  that,  for  each  n  ^  1 , 

1.  Bn<*  Cn\j  Dn  and  Cn<*  Bn\j  Dn, 

2.  Bn  £  Bnn,  C„  £  Cn+ 1,  Dn+ j  £  D„> 

3.  Dn+1<*  DnlDn+1, 

so  that  0  <*  £)„  for  all  n,  and  Z>n  contains  two  disjoint  events  (Z)n+1  and 
DjDn+ 1)  each  of  which  is  as  probable  as  Dn+X.  Hence,  using  (3)  and  C3«  *), 
(Ei  ■<  *  Dn+ 1,  E2  ■<  *  Z?„+i)  =>  ih  U  <C  *  i^n. 

Now  for  any  </  with  o  < *  (?,  Dn  <*  G  f or  sufficiently  large  n.  For 
example,  if  G  Dn  then,  with  {Ex, . . . ,  J?m}  for  0  <*  C  as  in  F5  with 
Et<*  G  for  all /,  E{  <*  Dn  for  all  i  so  that  El  u  E2  <  *  D„_i,  £,  u  Et  < * 
£>„_!, . . .  and  then  (J?  Et  <  *  Z>n_2,  U®  Et  <*  C-a»  •  •  •  and  so  forth,  so 
that  with  n  sufficiently  large  U£i  E(  <*  Dx,  or  S  <*  Du  which  is  false.  In 
addition,  0  f|n-i  E>n,  for  if  0  <*  Dn  then  Dm<*  f[  Dn  for 
sufficiently  large  nt,  and  this  is  false  since  fj  D„  £  Dm. 

Let 


B  =  Ufi„  and  C  = 

(Uc„\ 

u(nc) 

71  —  1 

\  »»*i  / 

{B,  C}  is  a  partition  of  A  since  (U  Bn)  n  (U  Cn)  =  (U  Bn)  n  (f)  Dn )  = 
(U  O  n  (fl  Dn)  “  0  •  To  verify  C  note  first  that  C  U  C  since 
fl  Dn~*  0  .  Suppose  that  B  <  *  C.  Then  B  <  *  (J  Cn  and,  by  C6,  there  is  a 
G  £  U  C„  for  which  0  <  *  G  and 

B  u  G  <*  ([J  Cn)lG. 

Since  B  n  G  =  0  and  Bn^*  B  (since  Bn  £  B),  F4  implies 

B„uG^*  Bv  G. 

For  large  n  Dn  <  *  G  so  that,  again  by  £4, 

Bnu  Dn<*BHuG. 

Since  £>„  n(UQ<*C  for  large  n  and  U  C  *  (U  CJG)  u  G  = 
(U  CJD„)  u  (((J  C)  n  /}„),  it  follows  by  C3«*)  that  for  large  « 

UCJG<MUC„)/D, 


198 


Sovag*’i  Expected-  Utility  Theory 


Finally,  since  ((J  Cn)fDK  s  C„,  Cl  =>  ({J  Cn)/Z>„  <*  C„.  This  and  the  four 
preceding  displayed  expressions  yield  Bn  u  Dn  <*  Cn  by  transitivity  (for 
large  n)  which  contradicts  Cn  <*  Bn\j  Dn  in  (1).  Therefore,  not  B  <*  C. 
By  a  similar  proof,  not  IJ  C„  <  *  (U  Bn)  u  (f"|  &n)  so  that  not  C  <  *  B. 

C9.  This  follows  from  C3(<*)  and  C8.  ♦ 

We  now  complete  the  proof  of  Theorem  14.2. 

Proof  of  (14.2) 

Let  F1-F5  hold.  We  shall  call  a  partition  {Alt . . .  ,  Am}  of  A  a.  u.p.  ( uniform 
partition)  when  0  <  *  A  and  At  Ag  Am,  and  let 

C(r,  2")  =  {A  :A  is  the  union  of  r  events  in  some  2"  part  u.p.  of  S). 

We  shall  establish  (14.2)  through  a  series  of  steps,  each  of  which  proves  a  key 
assertion. 

1.  [ AtBeC(r,2n)]^>A~*B .  First,  if  A,  Be  C(l,  2"),  and  if  A  <*  B, 
it  follows  easily  from  C3(<  *)  that  S  <*  S.  Hence,  if  A,  Be  C(l,  2"),  then 
A  <-»'*  B.  Therefore,  if  A,  Be  C(r,  2"),  A  <-->*  B  follows  from  C4. 

2.  [A  e  C(r,  2"),  B  e  C(r2m,  2"+m )]  =>A~*  B.  First,  if  A  e  C(l,  2")  and 
B  e  C(2m,  2n+m),  then  A  B,  for  otherwise,  by  step  1  and  C3«*)  we  get 
S  <  *  S'.  The  desired  conclusion  follows  from  C4. 

3.  [A  e  C(r,  2"),  B  e  C(t,  2m )]  (A  ^*Borf 2"  £  t/2m).  If r/2"  =  tj2n 
then  r2m  =  r2"  and,  with  D  e  C(r2m,  2n+m)  it  follows  from  step  2  that 
A  D  and  B~*  D,  so  that  A  B.  If  r2m  <  t2n  then,  with  Dx  e 
C(r2m,  2"+m)  and  DteC(t2n,  2"+m)  we  get  A  Dx  and  B~*  Dt.  But 
surely  Dx<*  Dt  when  r2m  <  t2”.  Therefore  A  <*  B. 

4.  For  A  s  S  let  k(A,  2”)  be  the  largest  integer  r  (possibly  zero)  such  that 
B  ^  *  A  when  B  e  C(r,  2"),  and  define 

P*(A)  =  sup  {*(,4, 21,)/2":n  =  0, 1,  2, . . .}.  (14.6) 

Clearly,  P*(0)  =  0,  P*(5)  ~  1,  and  P*04)  £  0  for  all  A  ^  S.  Moreover, 

A  e  C(r,  2")  =>  F*(A)  =  r/2".  (14.7) 

If  A  e  C(r,  2")  then,  by  (14.6),  F*(A)  £  r/2\  If,  in  fact  P*(A )  >  r/2"  then 
for  some  J5  e  C(t,  2m )  with  r/2n  <  f/2m,  5  ^  *  /4.  But  this  is  impossible  by 
step  3. 

5.  .4  ^  P*(B).  This  is  obvious  from  (14.6). 

6.  P*  is  finitely  additive.  Let  A  C\  B  —  0 .  It  follows  that,  for  each  n,  there 

is  a  2*  part  u.p.  of  S  for  which  A „  and  Bn  are  unions  of  elements  in  this 
partition,  with  An  n  Bn  =  0 ,  An  e  C(*(/4, 2n),  2n),  6  C(*(J?,  2n),  2n), 

4n  <*  4,  Bn^*  B.  Hence  by  C3,  and  *{.4, 2n)  + 

k(B ,  2K)  *04  u  5, 2n).  Since,  for  any  A  £  S,  it  is  easily  seen  that 


Axioms  for  Probability  199 

k(A,  2")/2n  does  not  decrease  as  n  increases,  it  follows  from  Exercise  10.7 
that 

P*(A)  +  P*(B )  £  P*(A  u  P). 

If  we  now  define  k*(A,  2")  as  the  smallest  integer  r  such  that  A  <  *  B  when 
B  e  C(r ,  2"),  it  readily  follows  from  the  fact  that  {r/2n:r  =-  0, . . .  ,  2tt; 
»  =  0, 1, ... .}  is  dense  in  [0,  1]  that  inf  {k*(A, 2n)/2n:«  =  0, 1 = 
sup  {k(A,  2")/2n :  n  =  0, 1, . . A  proof  symmetric  to  that  just  completed 
then  implies  that 

P*(A  u  B)<,  P*(A)  +  P*(B)  when  A  n  B  =  0 

so  that  P*(A  u  B)  =  P*(A)  +  P*(B). 

7,  0  -<  *  ^4  0  <  P*(A).  Let  0  A.  By  F5  there  is  a  partition 

{At, ... ,  An)  of  S  for  which  A(  <  *  A  for  each  1.  Then,  by  step  5,  P*(At)  <, 
Pm(A).  Finite  additivity  then  requires  that  P*(A)  >  0. 

A  P  =>  P*(A)  <  P*(B).  Suppose  A  <*  B.  Then,  using  F5,  there  is 
a  C  c  s  for  which  0  <*  C,  C  n  A  =  0 ,  and  C  u  A  <*  B.  By  finite 
additivity  and  step  5,  P*(C )  +  P*(A)  £  P*(B).  Since  P*(C)  >  G  by  step  7, 
P*U)<P*(B).  F 

Steps  5  and  8  imply  (14.2)  and  it  is  obvious  that  P *  as  defined  here  is  the 
only  probability  measure  on  S  that  satisfies  (14.2).  ♦ 

Proof  that  (B  £  S,  0  <,  P  <,  l)  =>  P*(C)  =  PP*(B )  for  some  C  £  fl 

If  P*(B)  =  C  the  result  is  obvious.  Assume  then  that  P*{B)  >  0,  and 

consider  a  sequence  {A\,  A\},  (A*, ...,  A*}, ....  { A • . Atf, ...  of  2" 

part  u.p.'s  of  B  for  which  {/*£.',,  A™}  is  a  2  part  u.p.  of  A”  For  a  given  n 
let  m  =  sup  {/:P*(Ut,  Af)  <  PP*(B)}  so  that 

P*(U  A?j  +  2 ~nP*(B)  £  PP*(B), 
and  let  k  =  inf  {;:P*(U£  A*)  <  (1  -  p)P*(B)}  so  that 

AV^j  +  2~nP\B)  ^  (1  -  P)P\B). 

Let  C„  s*  U"i  Af  and  Dn  =  U?I*  Af  so  that  Ct  £  Ca  £  •  •  • ,  Dl  £ 
A  -  '  ‘ ' ,  Cn  n  D„  =  0  for  all  n,  and  P*(Cn)  ^  />P*(P)  —  2 ~nP*(B)  and 
P*(On)  £  (l  -  P)P*(B)  -  2~»P*(B)  for  all  n.  Since  Cncyncn  and 
i>n  s  U„  On,  PP*(B )  £  P*(U  cn)  and  (1  -  p)P*(P)  £  P*(|J„  Dn).  More¬ 
over,  (U  C„)  n  (U  Dn)  —  0.  Hence,  by  finite  additivity,  Cl,  and  (14.2), 

^*(U  C, )  +  P*(U  />J  =  P*((U  C J  u  (U  O^)  £  P*(P) 
which  requires  P*((J  Cn)  =  PP*(B)  and  P*(U  Dn )  «  (1  -  p)P*(P).  4 


200 


Savage's  ExiHUtd-Utility  Theory 


14.3  PROBABILITIES  FROM  PREFERENCES 


This  section  shows  how  FI-F5  of  Theorem  14.2  follow  from  PI-P6  and 
(14.1),  which  is 


A  <  *  B  o  [(x  <  y,f  =  y  on  A,f  =  x  on  Ae,  g  =  y  on  B, 

g*szonBc)^/<g],  (14.1) 

If  x  /w  y  for  all  x,y  e  X  then  A  <  *  3  for  every  A,  B  c  A.  P5  clears  up  this 
potential  snag.  [Savage,  who  uses  a  different  definition  than  (14.1),  gets 
A  B  for  all  A,  B  £  S  when  PS  is  false.  His  definition  is  A  Bo 
[(*  <  y,  .  •  •)  =>/  <  g].  The  main  difference  here  is  stylistic.] 

Since  P5  says  that  x  <  y  for  some  x,y  e  X,  it  then  follows  from  (14.1) 
and  PI  that  <*  is  asymmetric:  /(<*2f^not  B  <+  A.  Suppose  not 
A  <.*  B  and  not  B  <*  C.  With  x  <  y  it  follows  from  P4  and  (14.1)  that 
(/  —  V  on  A,  /—  x  on  Ae,  g  as  y  on  A,  g  =  x  on  Be,  not  f<  g)  and  that 
(g  —  y  on  B,  g  s=  x  on  Be,  h  =  y  on  C,  h  =  x  on  Cc,  not  g  <  h),  so  that, 
using  PI,  (/  =  y  on  A,f—  x  on  Ae,  h  =  y  on  C,  h  =  x  on  Ce,  not / <  h), 
so  that  not  A  <  *  C.  Hence  (PI ,  P4,  PS)  =*  F3.  <  *  on  the  set  of  all  subsets 
of  S  is  a  weak  order. 

Letting  A  —  0  and  B  =  S  in  (14.1),  0  <*  S  follows  immediately  from 
the  definition  of  <  on  X.  0  <*  S  is  F2. 

Suppose  A  is  null  and  (x  <  y,  /=  y  on  A,  f  —  x  on  Ac,  g  —  *  on  S ). 
Then,  since  /=  g  on  Ae,  f~g.  Hence  not  A  <  *  0.  If  A  is  not  null  and 
(*  <  y,  f  =  x  on  S,  g  *=  y  on  A,  g  =  x  on  Ae)  then  f  <  g  given  A  by  P3, 
and  since  /  =  g  on  Ae,  f  <  g  by  the  definition  of  conditional  preference.  It 
then  follows  that  0  <*  A.  This  verifies  PI  in  the  presence  of  P3. 

F4  is  implied  by  P2  and  P4.  Assume  A  C\  C  =  3  r\  C  =  0 .  Tf  P5  is  false 
then  A  < *  B  and  A  \j  C  <*  B  \j  C  follow.  Assume  then  that  x  <  y.  Let 


f=y  on  A, 
g  =  y  on  B, 
f'  —  y  on  A  V  C, 
g'  —  y  on  PUC, 


/  =  x  on  Ac 
g  =  x  on  Bc 
f'  =  *  on  (A  U  C)c 
g'  ~  x  on  (flu  C)c. 


Since  /=/'  and  g  =  g'  on  C\  and  /  *»  g  and  /'  =  g'  on  C,  P2  says  that 
/  <gof  <g'-  ^  A  <*  B  then  f<g  by  (14.1),  then  /'  <  g',  then 
/tuC<*AuCby  (14.1)  and  P4.  By  the  reverse  procedure  A  u  C  <  * 
flu  C=>A  <*B. 

To  verify  F5  suppose  A  <*  A.  Take  x  <  y  by  PS.  With /,  g  as  in  (14.1), 
f  <  g.  By  P6  there  is  a  partition  {C:, ...»  Cm}  of  S  such  that  ft  <  g  when 
fi  sb  y  on  C,  and  f  —  /  on  Cev  Since  ft  =  y  on  A  \j  C,  and  =  a;  on 
(A  u  C,)*  and/  <  g,  (14. 1)  and  P4  imply  A  u  Ct  <  *  A. 


Utility  for  Simple  Acte 


201 


Thus,  F\-FS  follow  from  P1-P6  under  (14.1).  Therefore,  by  Theorem  14.2, 
P1-P6  imply  the  existence  of  P*  as  specified  in  (14.2)  and  (14.3). 

14.4  UTILITY  FOR  SIMPLE  ACTS 

P*  as  specified  in  (14.2)  and  (14.3)  induces  a  probability  measure  Pt 
on  (the  set  of  subsets  of)  X  for  each  f  e  F  as  follows: 

Pf(Y)  =  P*{f(s )  e  T}  for  each  Y  £  X,  (14.8) 

where,  as  usual,  P*{f(s)  e  Y}  means  P*({s:f(s)  e  7}).  Let  If,  be  the  set  of  ail 
simple  probability  measures  on  X  and  let  (T  =  {Pf:f  sF}.  With  F  the  set  of 
all  functions  on  S  to  X  it  follows  from  (14.3)  that  If,  £  3". 

Later  in  this  section  we  shall  prove  that  the  three  conditions  of  Theorem 
8.2  follow  from  PI-P6.  Before  doing  that  we  note  that  for  any  P  e  ff  there 
may  be  many  different  acts  in  F  that  have  this  P  as  their  measure  on  X 
induced  by  P*.  Clearly  then,  if  (14.4)  is  to  hold  it  is  absolutely  essential  to 
have /'-'-'g  when  P,  =  P„. 

THEOREM  14.3.  (P1-P6;  P,  =  Pt\ P„  P„  e  (f,)  => /~  g. 

Preparatory  to  pro  .  ing  this  we  shall  prove  two  lemmas,  the  first  of  which 
will  be  used  extensively  in  later  developments. 

LEMMA  14.1.  (PI,  P2,  {v4lt . . . ,  An}  is  a  partition  of  A,  /<  g  given  At 
for  each  i)  =>/<  g  given  A.  (PI ,  P2,  {Alt . . . ,  A„}  is  a  partition  of  A,f  <  g 
given  Atfor  each  i,f  <  g  given  Atfor  some  i)  =>/  <  g  given  A. 

LEMMA  14.2.  (P1-P4,  A  C\  B  —  0 ,  A  B,  f  ~  x  and  g  =  y  on  A, 
f—y  and g  —  x  on  B)=>f  ~ g  given  A  \j  B. 

Proof  of  Lemma  14, 1.  Let  the  hypotheses  of  the  first  part  hold.  Let /'  =  / 
andg'  =  g  on  A,  and/'  —  g’  on  Ae.  By  (P1,P2),/^  g  given  A  of  ^  g'. 
For  i  =  1 , . . . ,  n  —  1  let 

t 

ft  =  g'  -  g  on  U 

/-i 

/<«/'=/  on  U  A, 

i-i+i 

/(=/'  =  g'  on  Ae. 

Since /<  g  given  Ai  for  each  i,  (PI,  P2)  =>f  </x,/x  </, . /n_x  <  g' 

and  hence  /'  g'.If/<  g  given  At  for  some  i  also  then  one  <  in  the  sequence 

is  <  and  hence /'  <  g\  or/<  g  given  A.  ♦ 


202 


Sanage’a  Expected-  Utility  Theory 


Proof  of  Lemma  14.2.  Let  the  hypotheses  of  the  lemma  hold.  Let 

f'  =  y  on  B,  /'  = x  on  B e 

g'  =  y  on  A,  g '  =  x  on  A‘. 

If  x  <  y  then  f'~g'  by  A  B ,  (14.1),  and  P4.  Since  /'  =  g'  =  a:  on 
(A  u  B)e,  PI  =>/'  ^  g '  given  (A  u  B)c.  Then /'  ~  g'  given  A  u  B  for  other¬ 
wise,  by  Lemma  14.1 ,  either /'  <  g'  or  g '  <  /'.  Since /  =/'  and  g  =  g'  on 

A  u  B,  (PI ,  P2)  =>f~g  given  A  u  B.  If  y  <  x  the  conclusion  is  the  same. 

Finally,  suppose  x~y.  If  A(B)  is  null  then  f  ~g  given  A(B)  follows  from 
the  definitions  of  conditional  preference  and  null  events.  If  A(B)  is  not  null 
then  f~g  given  A(B)  follows  directly  from  P3.  Hence,  by  Lemma  14.1, 
f~g  given  A  u  B.  ♦ 

Proof  of  Theorem  14.3.  Let  PI-P6  hold.  We  are  to  prove  that  if  the  a,  are 
all  different  and  if 

/  =  x{  on  A{,  g  =  xi  on  B{  for  i  =  1 , . . . ,  n 
0  <  P*(At)  =  P*(Bt)  for  i  =  1, . . . ,  n,  and  2  P*(A<)  = 

i 

then  f^g.  Linder  these  hypotheses  Sj (J  A{  and  S/\J  B(  are  null  events 
(Exercise  17).  Hence,  with  /'  =/ on  (j  A^  f  =  xx  on  sj(J  At,  g'  =  g  on 
U  and  g'  =  xx  on  5/U  A,/'~/  and  g'  ~  g  so  that  f  ~  g  <=>/'  ~  g\ 

Thus  it  will  suffice  to  prove  that  f~g  when  {Ax . An)  and  {Bu  . . .  ,  Bn} 

are  partitions  of  S. 

f~g  if  n  =  1.  Using  induction  on  n  >  1  we  shall  “eliminate”  xn.  Thus, 
assume  the  theorem  is  true  for  n  —  1,  and  with  n  >  1  let 

A  -  A„  n  B‘  and  B  =  B„  n  A‘ 

so  that  A  n  B  *=  0  and  P*(A)  =  P*(B),  the  latter  by  P*(An  n  Ben)  + 
P*(An  n  Bn)  *  P*(An)  =  P*(Bn )  *  P*(Bn  n  A\)  +  P*(Bn  n  AJ.  Let 
A  =  B  n  At  for  i  =  1, . . . ,  n  so  that  {Dl, . . . ,  A-i)  is  a  partition  of  B. 
Then,  by  (14.3),  there  is  a  partition  {Cx, . . . ,  C„_i}  of  A  for  which 

P*(C,)  =  P*(  A)  i  =  l . it  —  1.  (14.9) 

Let  fa —f  and  define  flt . . .  ,/n_x  recursively  thus:  ft  ~  f-i  on  (C,  \j  A)*> 
fi  —  x*  on  A,  f  —  xt  on  C4.  Figure  14.1  illustrates  this  along  with  g.  By 
(14.9)  and  Lemma  14.2,  f  ~f-\  given  C,  u  A  for  /  »  1, . . . ,  n  —  1. 
Since  f  =  /<_1  on  (C(  u  A)#.  f  ~  f_x  given  (Ct  u  A)*-  Therefore,  by 
Lemma  14.1  ,f  ~  f^x  for  i  =  1 . it  —  1  so  that  /  fn-V 


Utility  for  Simple  Aett 


203 


4, 


A  ~AnnB‘n 

Cs  Cfl_i 

B^A‘Hn  Bn 

Dt  D3  A*-l 

An 

*»  *«  xn  •”  *» 

*1  *1  *3  ■  •  •  *n-l 

xn 

f 

A 

*1  *n  *n  xn 

*n  *1  z3  •••  *n— 1 

xn 

r 

/a 

*1  *1  *n  •••  *» 

x«  x»  x3  *n— 1 

*n 

r 

fn-~\ 

xi  *a  *3  '  ” 

*»  *n  ’  ’  1  *» 

xn 

f 

g 

g  g  g  •••  g 

*»  *n  *n  *n 

xn 

g 

Figaro  14.1 

It  remains  to  show  that/n_,  ~g.  With  Bn  —  B\j  (An  r\  Bn)  let 

/'  “A-i  on  B*  g'  =  g  on  B* 

*  z„_i  on  B„  =  xn_t  on  B„. 

Then,  as  shown  by  Figure  14.1,  the  only  consequences  that  can  occur  with 
/'  and  g  are  xlt ... ,  xn^,  xn  has  been  eliminated.  By  (14.9)  and  Figure 

14.1 ,  F*{/b_1(j)  =  *,}  =  P*{f(s)  =  *,}.  Hence 

P*{f  =  =  B*{g'  *  *,}  =  P*{Bt)  for  /  -  1 , i*  -  2, 

**{/'  =  *n-l>  -  1>  =  +  P*(Bn) 

which  fits  our  initial  format  with  n  replaced  by  n  —  1.  Thus /'  by  the 
induction  hypothesis.  Then,  since  f'^g'  given  B„,  Lemma  14.1  requires 
/'  ~g  g«v«n  Bn-  Then,  since  =  /'  and  g  =  g'  on  given 

Ben.  Also,  since «  g  on  B„,/„_1  <—  g  given  Bn.  Henc e/n_x  ~gby  Lemma 

14.1.  4 

The  Axioms  of  Chapter  8 

Defining  <  on  (T,  by 

P  <  Q  of  <  g  whenever  Pf  =  P  and  Pt  =  Q,  (14.10) 

PI  and  Theorem  14.3  imply  that  <  on  if,  is  a  weak  order.  The  second  and 
third  conditions  of  Theorem  8.2  follow  from  the  next  two  lemmas. 

LEMMA  14.3.  (P,  Q,  Re(T„  0<«<1,  B1-P6)  =>  (P  <  Q  o  xP  + 
(1  —  a )R  <  v.Q  +  (1  -  a)B). 

LEMMA  14.4,  (B,  Q  e  1P„  /e  F,  P  <  Q,  P  <  /<  Q,  B1-B6)  =>  there  is 
one  and  only  one  a  €  [0,  1]  such  that  f  ~  <tP  +  (1  —  a)Q. 


204 


Savage's  Expected-Utility  Theory 


Of  course /  <  P  of  <  g  when  Pt  —  P,  with  similar  definitions  for /  <  P, 
f~P,...  .  Theorem  14.3  guarantees  no  ambiguity  here  as  long  as  P  e 
The  fact  that  Lemma  14.4  holds  for  any / 6  Fwill  be  used  in  the  next  section. 

Proof  of  Lemma  14.3.  Throughout  this  proof  and  the  proof  of  Lemma  14.4 
we  shall  take  {*,,  ...  ,xn)  =  {x-P(x)  >  0}  with  P(xt)  -  «0  {ylt . . . ,  ym}  * 
{x:Q(x)  >  0}  with  Q(yJ  =  so  that  j  «»  =  and  let  w  be  a 

most  preferred  consequence  in  {xu  yl4 ...  ,  ym}.  /1(a)  will  denote  an 

event  in  S  for  which  P*  =  a.  Equation  (14.3)  w///  Ac  used  freely  to  construct 
events  with  various  probabilities. 

For  Lemma  14.3  we  shall  consider  f  <  g  given  0(a)  with  0( a  >  0)  £  S', 

"*<)»*<  O'  ~  1»  ••• .  *)  and  =  &)  =  &  (j  ~  1 - -  m).  In 

view  of  Theorem  14.3  and  the  first  paragraph  of  its  proof,  f  <  g  given 
0(1)  oP  <Q,  and f<  g given  0(a)  o  otP  +  (1  —  a )R  <  *Q  +  (1  —  a )R. 
(When  a  <  1  let/  =  g  on  0(a)c  with  probabilities  on  0(a)e  equal  to  (1  —  a) 
times  the  positive  R(x).)  To  prove  the  lemma  we  shall  show  that  if  f  <  g 
given  D{ct)  for  one  a  e  (0, 1]  then  f  <  g  given  0(a)  for  every  a  e  (0, 1], 

Thus  suppose  that  f  <  g  given  D(y).  Then,  by  considering  n  part  uniform 
partitions  of  0(y),  it  follows  from  Lemma  14.1  that / <  g  given  0(y/n)  for 
every  positive  integer  n.  Moreover,  f  <  g  given  0(ry)  for  every  rational 
number  r  6  (0,  1/y], 

Let  0  <  <  1  be  such  that  f  <  g  given  D(Jf).  Let 

f*  =/  and  g*  =  g  on  0(/?),  /*  —  g*  =  w  on  0(/?)c 

so  that  f*  <  g*.  Then,  using  P6  m  times  (once  for  each  yt)  and  Lemma  14.1 
if  necessary  (so  as  not  to  exhaust  all  of  1  —  (i  before  the  m  uses  of  P6  are 
completed),  we  obtain  g"  with  f*  <  g *  and 

g"  =  g*~  g  on  />(/?) 

=  t/f  on  CjiXj)\  Xj  0,  CjiXf)  £  D(f})e,  Cj  r\  Ck  =»  0, 

ii  «  1, . . . ,  m) 

*»  w  on  \d(P)  U  ^UC,(A,)^. 

Taking  Cfdfa)  c  Cf(Xs)  for  j  *=  1 , . . . ,  m  with  d  >  0,  let  g°  =  g'  except 
that  g®  =»  h>  on  C,(Xi)lCi(6pj).  Since  yt  K.  w,  Lemma  14.1  implies  that 
g*  <  £,  with 

g°  =  g  on  DiP  +  d)  =  U  ^0 CXW) 
a  w  on  D(/?  +  d)c. 


Utility  for  Simplt  Acts 


205 


Also  take/0  =  /*  except  that/0  =  xt  on  £,(da,)  where  the  Et  form  a  partition 
of  Ujlt  Since  x(  <  w,f°  </*  by  Lemma  14.1  with 

/°=/  on  2>(£-M) 

=  >v  on  D(j S  -f  <3)e. 

Then  /°  <  g°  since  /S/*<f'<  £°,  and  hence  /  <  £  given  D(fi  +  d). 
Since  this  holds  for  all  <J  in  some  interval  (0.  /],  it  follows  from  this  and  the 
preceding  paragraph  that  f  <g  given  D(a)  for  all  a  e  (0,  1).  Also,  since 
f  <  g  given  2>(l/2),  Lemma  14.1  gives  f  <  g  given  Z>(1).  + 

Proof  of  Lemma  14.4.  As  in  the  proof  of  Cl  of  Theorem  8.3,  (P,  Q  e  (Ta, 
P  <  Q,  0  ^  a  <  ^  1)  =>  (IP  +  (1  —  {$)Q  <  xP  ■+•  (1  —  ot)Q  follows  read¬ 

ily  from  Lemma  14.3.  Thus,  under  the  hypotheses  of  Lemma  14.4  there  is 
one  and  only  one  a  e  [0, 1  ]  such  that 

0P+  (1  -P)Q<f  if  /8  >  a  (14.11) 

f<0P+(l-P)Q  if  p  <  a.  (14.12) 

Clearly,  only  a  can  satisfy f~  &P  4-  (1  —  «)(?. 

Suppose  then  that  <xP  +  (1  —  ol)Q  <  /.  This  requires  a  >  0.  Let 

g  —  *i  on  £K«a<)  i=  1 ,...,« 

=  Vi  on  />((  1  -  a)ft)  ;  -  1 . m 

where  {^(aaj, ....  Z>((1  —  a)/Sm)}  is  a  partition  of  5.  Then  Pt  =  oP  + 
(1  —  a)£>*  Hence  g  </by  Theorem  14.3  and  PI.  Then  by  repeated  uses  of 
P6  obtain  g'  </ where  g’  —  g  except  that  g'  =  w  on  C,(y,  >  0)  c  Z)(aa,) 

for  i  =  1 . n.  With  /S  <  a  and  cl  —  p  small,  take  C/y'  >  0)  £  C<(y,) 

with  y'  =  (a  —  /?)<*<  and  let  g°  »  g’  except  that  g°  =  xt  on  C/y^/C/y,').  By 
Lemma  14.1 ,  g°  <  g'  with 

g°  =  *<  on  Z>(/8«,)  <=  D(«a.)  /  -  1 . n 

=r  yf  on  Z>((  1  -  a)/?,)  j  =  1 , . .  .  ,  m 

—  w  on  D(  a  —  j8). 

Now  change  g*  to  A  by  partitioning  D(x  —  /?)  into  {D(^t(a  —  /?)), .... 
D(/ 3m(a  —  /?))}  and  replacing  w  on  a  —  /?))  by  By  Lemma  14.1, 
*  ^  g°,  so  that  by  transitivity  A  <  /.  But  Ph  =  (IP  +  (1  —  p)Q  by  construc¬ 
tion  and  since  P  <  a  we  have  obtained  a  contradiction  to  (14.12).  Hence 
aP  +  (1  —  a )Q  <f  is  false.  Similarly,/ <  otP  +  (1  —  a )Q  is  false  for  this 
leads  to  a  contradiction  of  (14.11).  Hence  f~  ouP  +  (l  —  a)Q.  + 

In  view  of  the  results  of  this  section  and  those  of  Chapter  8  we  can  state  the 
following  theorem,  in  which  P*  is  as  given  by  (14.2)  and  (14.3)  through  (14.1). 


206  Savage's  Expected  Utility  Theory 

THEOREM  14.4.  PI-P6  imply  that  there  is  a  real-valued  function  u  on  X 
such  that 

f<go  E[u(f(s)),  P*]  <  £[«(£(*)),  P%  for  all  Pt,  Pg  e  0*,,  (14.13) 

and  when  u  satisfies  this  representation  it  is  unique  up  to  a  positive  linear 
transformation. 

In  the  rest  of  this  chapter,  u  is  assumed  to  satisfy  (14.13). 

14.5  UTILITIES  ARE  BOUNDED 

In  proving  that  u  on  X  is  bounded,  we  shall  use  the  following  lemma, 
whose  proof  follows  easily  from  PI. 

LEMMA  14.5.  (PI,  P2,  PI,  x  </ given  A  and  x  <  g  given  A  for  every 
xeX)  =>f~g  given  A.  (PI ,  P2,  PI,  /<  x  given  A  and  g  <  x  given  A  for 
every  x  e  X)  =>/~  g  given  A. 

In  the  proof  of  the  following  theorem  sup  T  —  oo  means  that  T  is  a  set  of 
real  numbers  and,  if  c  e  Re,  t  >  c  for  some  t  e  T.  sup  T  =  oo  means  that 
Pis  unbounded  above.  In  addition,  when /is  such  that  P*  {«(/($))  >  d]  =  1 
for  some  number  d,  E(u ,  Pf)  =  oo  means  that  sup  {P[inf  {«(/ (j)),  c},  P*] :  c  e 
Re}  =  oo. 

THEOREM  14.5.  (P1-P7)  =>uonX  is  bounded. 

Proof.  Let  P1-P7  hold  and  suppose  that  u  on  X  is  unbounded  above. 
Using  (14.3)  construct  a  sequence  Blt  Bt, . . .  of  disjoint  events  in  S  with 
P*(Bn)  —  2~n  for  n  =  1 , 2, .  . .  .  If  (J“_1  Pn  does  not  exhaust  S,  add  S/\J  Bn 
to  Bx.  Take  u(xn)  ^  2n  for  each  n  and  let 

/=*„  on  Bn,  b®  1,2 . 

so  that  E[u(f(s)),  P*j  =  oo  since 

J?[inf  {«(/($)),  2 "},  P* ]  £  2P*(BMxd  £  2^  =  * 

<-l  <- 1 

for  n  —  1 , 2, . . .  .  Let  x  be  any  consequence.  Then,  for  some  y  e  {xt,  xt, . . 

m(s)  <  P[inf  {«(/(*))>  u(y)},  P*].  (14.14) 

Let  f  ~  f  on  { s:f(s )  ^  y}  and  f'  =  y  on  {s:y  <  f(s)}.  Then  Pf,  e  JT,  and 
(•*))>  ^*1  =  2?[inf  {«(/($)),  «(y)),P*]  so  that,  by  Theorem  14.4  and 
(14.14),  x  <  /'.  But  /'  ^/ by  Lemma  14.1  since,  by  PI,  f  ^f  given 
{j:y  <  f(s)}.  Hence  x  <  f.  Therefore  *-</ for  every  x. 


207 


Utility  for  ail  Acts 

Next,  let  z  be  such  that  ufo)  <  u(z).  Let  g  =  z  on  Bx  and  g  =  /  on  B*v 
As  in  the  preceding  paragraph,  *  <  g  for  every  x,  so  that  f~g  by  Lemma 
14.5.  But/  <  g  given  Bx  since  xx  <  z  and  P*{Bi)  >  0,  and  f~g  given  B[ 
since  /  =  g  on  B[.  Hence  /  <  g  by  Lemma  14.1 ,  a  contradiction.  Hence  u  is 
bounded  above.  A  symmetric  proof  shows  that  u  is  bounded  below.  ♦ 

14.6  UTILITY  FOR  ALL  ACTS 

To  establish  f  <  g  o  E(u,  Pf)  <  E(u,  Pa)  for  all  acts  we  shall  first  prove 
two  lemmas. 

With  P  e  31,,  g  =  P  on  A  means  that  P*{s  e  A  and  g(s)  *=  x}  =  P*(A)P(x) 
for  all  x  e  X.  We  define  f  <  P  given  A  of  <  g  given  A  for  every  g==P  on 
A.P<.f  given  A  is  similarly  defined,  and  f~P  given  A  and  /  <  P  given  A 
are  defined  in  the  usual  way.  Note  that,  by  Theorem  14.3,  if  f  <  g  given  A 
for  one  g  -'=  P  e  (f,  on  A,  then / <  h  given  A  for  every  h  =  P  on  A.  If  A  is 
null,  g  ==  P  on  A  for  every  g. 

LEMMA  14.6.  (PI-P7,  A  ^  0 ,/  x given  A,  u(f  (s))  <  c  for  alls  eA)=> 
there  is  a  P  6  IT,  for  which  f  ^P  given  A  and  E(u ,  P)  <,  c.  (PI-P7,  A  ^  0 , 
*</ given  A,  c  <  u(f  (s))  for  all  se  A)  =>  there  is  a  Pe  $tfor  which  P  / 
given  A  and  c  <[  E(u,  P). 


LEMMA  14.7.  (PI-P7,  {Bu  is  a  partition  of  S ,  u(f(s))  <  q  for 

alls  e  B,  (/  =  1 . /»),  E(u,  P)  £  22*  P*(B<)q.  (P1-P7, 

{Bx, ... ,  B„)  is  a  partition  of  S,  q  <  u(f(s))  for  all  s€  Bt  {i  —  t, ... ,  n), 
P  e  (P „/<  P)  =>  2JL,  P*(Bt)q  £  E(u,  P). 


It  will  suffice  to  prove  the  first  part  of  each  lemma  In  each  proof  the 
hypotheses  of  the  first  part  are  assumed  to  hold. 

Proof  of  Lemma  14.6.  If  u(x)  <,  c  let  P(x)  =  1 .  Then  f  ^  P  given  A  by 
hypothesis  and  E(u,  P)  =  u(x)  <,  c.  Henceforth  suppose  that  c  <  u(x).  Let 
y  be  any  consequence  for  which  u(y)  <,  c,  as  assured  by  A  yi  0  and 
u(f(s))  <  c  for  all  seA.  Let  P  be  the  unique  combination  of  x  and  y  for 
which  E(u,  P)  —  c.  If  A  is  null  then  f  ^  P  given  A  and  the  proof  is  complete. 
Henceforth  assume  that  P*(A)  >  0. 

Fix.  /  e  A.  Let  g  =  P  on  A  and  g  =» /(r)  on  Ae.  Since  «(/(/))  <  <?> 

«(/('))  -  P*(A)u(f(t))  +  P*{AQ)u(f(t)) 

<  P*(A)E(u ,  P)  +  P*(Ae)u(f  (t))  -  jEMgfc)),**] 

so  that,  by  Theorem  14.4, /(r)  <  g.  Hence  f(t)  <  g  given  A.  Since  this  holds 


248  Sava^i'i  Expected- UtUity  Theory 

for  each  /  e  A,  PI  implies  that / ^  g  given  A.  Since  g  —  P  on  A,f  ^  P  given 
A.  ♦ 

Proof  of  Lemma  14.7.  Suppose  the  conclusion  is  false  for  some  /  and  P  so 
that  2  P*(Bi)c  <  E(u,  P).  Since  this  can’t  hold  if  P  is  confined  with  proba¬ 
bility  1  to  worst  consequences,  it  follows  that  there  is  a  Q  e  3*.  for  which 
2  P*(Bi)ci  <  E(u,  Q)  and  Q  <  P  ^  f  Hence,  if  the  lemma  is  true  when  its 
P  ^  /hypothesis  is  replaced  by  P  <  f  then  the  original  lemma  must  be  true. 
Thus,  it  will  suffice  to  show  that  if  PI-P7  hold,  if  {Bt, ....  Bn}  is  a  partition 
of  S and  if 

1.  u(f(s))  <  Ci  for  all  s  e  Bit  i  =  1 , . . .  ,  n,  and 

2.  P  e  tf,  and  P  <f, 

then  E(u,  P)  <,  2*„i  P*(Bi)ci. 

To  prove  this  we  show  first  that / can  be  modified,  if  necessary,  so  that  (1) 
and  (2)  hold  for  the  modified / and,  for  each  /,  there  is  a  y,  such  that  modified 
/^  y,  given  A.  If  there  is  a  y,  such  that /  <  y,  given  B{,  we  cease  to  worry 
about  this  i.  On  the  other  hand,  suppose  x  </  given  5,  for  every  xe  X. 
Then  B,  can’t  be  null  so  that  P*(Bt)  >  0.  For  this  Bt  take  y  <  z  and  n(y)  <  ct. 
With  P  <  f  by  (2),  it  follows  from  P6  that  there  is  a  non-null  A  £  Bt  for 
which  P  <f  when  f  —  f  except  on  A  where /'  =  y.  Let  f*-f  except  on  A 
where  /*  —  z.  Since  y  <  z,  /'  <  /*  given  A.  Hence  /'  <  f*  given  Bt  by 
Lemma  14.1.  It  cannot  be  true  that  x  <  /'  given  B,  for  every  xe  X  for 
otherwise,  by  Lemma  14.5,/'  ~f*  given  B,,  a  contradiction.  Hence  there 
is  a  ViG  X  such  that  /'  y,  given  B{.  Since  (1)  and  (2)  hold  for  /'  we  see 
that,  by  considering  each  /,  we  obtain  an  act  g  that  satisfies 

1.  u(g(s))  <  c,  for  all  s  e  B„  i  =  1, . . . ,  n, 

2.  P  €  (T,  and  P  <  g, 

3.  There  is  a  y,  E  X  for  which  g  yt  given  B,,  i  =  1 , .  . .  ,  n. 

Given  such  a  g.  Lemma  14.6  implies  that,  for  each  /,  there  is  a  Qi  e  ‘Ss  such 
that  g  <  Qi  given  B,  and  E(u,  QJ  £  c,.  Let  A  =  &  on  B*  for  /  =  I , . . . ,  rt. 
Then,  by  Lemma  14.1,  g  ^  h  so  that  P  <  h.  Since  Ph  =  ^,P*(Bi)Qt, 
Theorem  14.4  implies  that  E(u ,  P)  <  E[u(h(s)),  P*}.  Since  E[u(Ji{s)),  P *]  = 
2  P*(Bi)E(u,  £  2  />*(*,)<■,.,  £(u,  P)  <  2  P*(B,)c,,  ♦ 

Expected  Utility  for  AH  Acts 
THEOREM  14.6.  P1-P7  =>  (14.4). 

Proof.  By  an  appropriate  positive  linear  transformation  of  w,  we  use  P5 
and  Theorem  14.5  to  specify 

inf  {u(x)\x  e  X}  —  0,  sup  {u(x):x  e  X}  =  1. 


Utility  for  oil  Acts 


109 


Each  act  in  F  falls  into  exactly  one  of  the  following  classes: 

1.  /is  big  o  x  </for  every  x  e  X, 

2.  /  is  little  o /<  a:  for  every  *  e  X, 

3.  /is  normal  <=>  x  <  /  =<  y  for  some  x,y  e  X. 

Suppose  first  that  /is  normal.  Lemma  14.4  guarantees  that  there  is  a  P  e  fl*f 
such  thatP— /.  Divide  5 into  Ax  -  {j:0  <,  u(f  (s))  <,  1  /«},  A{  =  {j:(i  -  1)/ 
n  <  «(/ (*))  ^  </«}  for  i  =  2, ...  ,  n.  Some  of  the  At  may  be  empty.  By  the 
definition  of  expectation  (Definition  10.12,  Exercise  10.16),  2<  E*(At)(i  —  1)1 
*  <*  E[u(J(s)),Pm]  <;  '2,lP*(At)iln.  Also,  by  Lemma  14.7,  2<  p*(At)(i  — 
1  —  «)/n  ^  P(u,  P)  <,  2,  i>*(yt,)(/‘  +  e)//r  for  any  e  >  0.  Letting  n  get  large 
it  follows  that 

E[u(J (j)),  P*\  =  £(u,  P)  when/—  P,  P  e  O',.  (14.15) 

Suppose  next  that/is  big.  By  Lemma  14.5,  all  big  acts  are  indifferent.  We 
shall  prove  that 

f  is  big  =>  u(x)  <  1  for  all  x,  P*{u(f(s))  ^  1  —  e}  =  1 

for*>0,E[u(f(s)),P*]  =  l. 

With /big  suppose  first  that  u(w)  =  1  for  w  e  X.  Take  x  <  w,  using  P5. 
Let  A  «  {j:u(/(s))  <  1>,  Ac  =  (s:u(f(s))  =  1}.  Then,  using  PI  if  A  is  not 
null,  as  in  the  final  part  of  the  proof  of  Lemma  14.6,  it  follows  that  /  <  w 
given  A.  [Suppose  that  w  <  /given  Ae  (requiring  Ae  to  be  non-null).  Then,  by 
P6,  there  is  a  non-null  B  £  Ae  with  w  <  /'  given  Ae  and /'  = /except  on  B 
where/'  »  x.  Let  /'  =  /  except  on  B  where/'  =  vt\  Then/'  </'  given  Ae 
by  Lemma  14.1.  But  then,  using  Lemma  14.5,/'—/'  given  Ae,  a  contra¬ 
diction.]  Hence  f  ^  w  given  Ae  so  that  /  w  by  Lemma  14.1.  But  /<" 
contradicts /’ s  bigness.  Hence /is  big  =>  u(x)  <  1  for  all  x  e  X. 

Suppose  next  that  for  big  /  there  is  an  €  >  0  for  which  P*{u(f(s))  ^ 

!  —  «}<!•  Then,  with  A  —  {s:u(f(s))  <  I  —  e},  P*(A)  >  0.  It  follows 
from  the  preceding  paragraph  that  we  can  select  y,  z  e  X  so  that 

1  —  c  <  u(y)  <  u(z)  <  i. 

Let/'  =/'  = /except  on  A  where/'  =  y  and/'  =  z.  Then,  since  u(f(s))  < 
u{y)  for  all  s  e  A,f  <  y  given  A.  This  leads  to /</'  < /'.  But  since /is  big 
/'  is  then  big  also  and  hence /—/'by  Lemma  14.5,  a  contradiction.  There¬ 
fore  P*{u(/(s))  ^  1  -  e}  =  1  for  every  «  >  0.  Therefore  E[u(J(s)),  P*]  ^ 

1  —  c  for  every  «  >  0  and,  since  E  can’t  exceed  1  (Exercise  10.22a), 
=  1. 

By  a  symmetric  proof  for  little  acts  it  follows  that 
/  is  little  =>  0  <  u(x)  for  all  x,  P*{u(f  (s))  t}  =  1 

fore>0,  E[u(J(s)),P*]~  0, 

and,  by  Lemma  14.5,  all  little  acts  are  indifferent  to  each  other. 


210 


Savage's  Expected-Utility  Theory 


(14.4)  follows  readily  from  Theorem  14.4,  Lemma  14.5,  the  fact  that 
every  normal  act  is  indifferent  to  some  P  e  S',,  and  from  (14.15)  and  the 
implications  for  big  and  little  acts.  ♦ 

14.7  SUMMARY 

Savage’s  axioms  for  expected  utility  apply  <  to  the  set  F  of  all  functions 
on  S  to  X  (states  to  consequences).  When  <  *  (is  less  probable  than)  is 
defined  on  the  basis  of  <  is  an  appropriate  way,  his  first  six  axioms  imply 
that  there  is  a  probability  measure  P*  on  S  that  satisfies  A  <  *  B  o  P*(A )  < 
P*(B),  for  all  A,  if  £  S,  and,  when  this  holds,  (B  £  S,  0  <,  p  £  1)=> 
P*(C)  —  pP*(B)  for  some  C  £  B,  and  P*  is  unique.  This  latter  property 
implies  that  the  set  {Pf\f  eF)  of  probably  measures  on  X  induced  by  P* 
on  S  includes  the  set  £T,  of  all  simple  measures  on  X.  By  showing  that  axioms 
similar  to  those  of  Chapter  8  follow  for  <  on  (Tt,  we  obtain  an  expected- 
utility  representation  for  31,,  or  for  the  set  of  simple  acts.  Savage’s  seventh 
axiom  then  implies  that  the  utility  function  u  on  X  is  bounded  and  that  the 
expected-utility  representation  f  <  g<=>  E[u(f(s)),  P*]  <  is  [«(£($)),  P*],  or 
equivalently /  <  g  o  E(u,  Pf)  <  E(u ,  P9),  holds  for  all  acts. 

Savage’s  book  (1954)  contains  an  excellent  section  on  “Historical  and 
critical  comments  on  utility”  (pp.  91-104)  that  should  be  studied  by  everyone 
interested  in  utility. 


INDEX  TO  EXERCISES 

1-2.  Probability  axioms  for  finite  sets.  3.  A/B.  4.  Cl.  5.  Qualitative  probability  impli¬ 
cations.  6.  FS.  7-9.  Uniform  partitions,  almost  agreeing  measures.  10-14.  Fine  and  tight 
qualitative  probabilities.  15.  <  given  A.  16.  Failure  of  P2.  17.  A  is  null  <=>  A  0  ■ 
18.  Discrete  measures  in  9*.  19.  Conditional  probability.  20.  P1-P6  hold,  P7  fails.  21.  A 
variant  of  PI,  22.  PI-P7  do  not  imply:  P*{f(s)  -< g(s)}  =  1  =>/-<£• 


Exercises 

I.  Kraft,  Pratt,  and  Seidenberg  (1959),  Scott  (1964).  Use  the  Theorem  of  The 
Alternative  (Theorem  4.2)  to  prove  the  following  theorem.  Suppose  that  S  is  finite. 
Then  there  is  a  binary  relation  <*  on  tk*  set  of  all  subsets  of  S  that  satisfies  (14.2) 
if  and  only  if,  for  all  ss  S,  all  Alt . . .  ,  B„  £  S  and  all  m  i>  2: 

1.  not  {j}  -<*  0, 

2.  0  <*S. 

3.  (£*•  Aj  «  B},  A f  <*  B}  or  A$  *  Bt  for  each  j  <m)=>  not  Am  <*  Bm. 


Exercises 


211 


In  (3),  A  io( not  ^  ■<*  5,  not  B  <*  /t),  and  /<,  -Jf  F,ofor 

each  s,  the  number  of  Ai  that  contain  s  equals  the  number  of  Bf  that  contain  a. 

2.  (Continuation.)  Kraft,  Pratt,  and  Seidenberg  (1959).  Let  S  —  {p,  q,  r,  s,  t) 

and  denote  a  subset  of  S  such  as  {p,  q,  /)  by  pqt.  Let  -<*  on  the  set  of  all  events  be 
given  by 

0  <*p  <*q  <*r  <*pq  <* pr  <*  s  <*  ps 
<*qr  <*  t  <*pqr  <*  qs  <*  rs  <*  pt  <* pqs  <*  qt 
<*prs  <*  tt  <*qrs  <* pqt  <* prt  <*  st  ■<* pqrs  <* pst 
<*qrt  <*pqrt  <*qst  <*  rst  <* pqst  <* prst  <*  qrst  <* pqrst, 

in  which  the  order  of  the  last  two  rows  is  the  order  of  the  complements  the  first 
two  rows  in  reverse.  Clearly,  FI,  F2,  and  F3  of  Theorem  14.2  hold. 

a.  Show  that  F4  holds. 

b.  Show  that  condition  (3)  in  Exercise  1  fail*, 

3.  For  any  A,  B  s  S  verify  that 

a.  (A]B)  n  (A  n  B)  -  0  and  (AjB)  v(A  nB)=*  A, 

b.  A  u  (B/A) -B\J  (AfB)  -  A  u  B, 
e.  (AfB)  n  (B/A)  -  0 , 

d.  (A/B)  u  (B/A)  ~(AV  B)/(A  n  B), 

e.  (A/B)  u  (B/A)  u  (A  B)  *  A  u  B, 

f  (A/B)  u  (B/C)  =«  (A/C)  kj  ((A  n  Q/B)  u  (B/(A  n  Q),  with  A/C,  (A  n  C)/B, 
and  B/(A  r\  C)  mutually  disjoint. 

4.  Prove  Cl  of  Section  14.2. 

5.  Let  -<*  satisfy  F1-F4.  Verify 

a.  A  -<*BoA/B  <*  B/A, 

b.  A  <*BoB*  Ao, 

c.  (A  <*  A',  B  <*  F«)  =>  A  <*  B°, 

d.  S  <  *  B=>  B  S  and  A  n  B  A, 

e.  (A  B,C  D,A  v  B  C  v  D,  A  n  B  =  C  n  D  *  0)=>  A  C. 

6.  Without  using  C5-C9,  prove  that  (FI  -F5, 0  <*A,0  <*  B)=>  0  C  <* 
A  for  some  Csfi. 

7.  (Continuation.)  Let  F6  be:  If  0  <*  A  then  A  can  be  partitioned  into  B  and  C 
with  B  C,  as  in  C8.  On  examining  the  proof  of  (14.2)  through  step  6,  argue  that 
F1-F4  and  F6  imply  that  there  is  a  unique  probability  measure  F*  on  the  set  of  events 
that  satisfies 

A  <*B=>P*(A)£P*(B),  for  all  A,  B  £  S.  (14.16) 

9.  (Continuation.)  Show  that  the  proof  of  (14.3)  holds  for  the  situation  of  the 
preceding  exercise,  so  that  F1-F4  and  F6  imply  (14.3)  when  F*  satisfies  (14.16). 

9.  Let  F7  be:  For  every  position  integer  n  there  is  an  n  part  up.  ofS.  On  examining 
the  proof  of  (14.2)  through  step  6,  argue  that  F1-F4  and  F7  imply  that  there  is  a 
unique  probability  measure  P*  on  the  set  of  events  that  satisfies  (14.16).  Also  prove 
that  (F1-F4,  F7)  =>  (14.3). 


212 


StMjc'i  Exyecttd  Utility  Theory 


10.  Following  Savage  (pp,  36-37)  we  define  the  following  terms  for  a  qualitative 
probability  <*  on  the  events  in  S  (that  satisfies  F1-F4): 

<*  is  fine  o(0  <*  A  =>  there  is  a  finite  partition  of  .S' each  element  of  which  is 
not  more  probable  than  A). 

<*  is  tight  oA  B  whenever  A  <*BuC  and  B  A  u  D  for  all  Cand  D 

that  satisfy  (Br\C-Ar\D-0,0  <*C,0  <*  D). 

Given  F1-F4,  prove  that  -<*  is  both  fine  and  tight  o  FS  holds. 

11.  (Continuation.)  Following  Savage  (p.  41),  let  St  —  [0, 1],  St «-  (2, 3]  and 
let  P{  be  a  finitely  additive  probability  measure  on  the  set  of  all  subsets  of  St(i  —  1,2) 
that  agrees  with  Lebesgue measure  [e.g.,  Pida,  b])  »  b  —  a  when  0  £  a  g  b  1] 
on  the  Lebesgue  measurable  subsets  of  Si.  Let  S  =  S1^>St  and,  for  any  A  £  S 
let  Ax  *  A  n  S,  and  At  —  A  n  5*.  Define  <*  on  the  set  of  all  subsets  of  S  as 
follows:  A  <*  Bo  PM  <  PM  or  (PM  -  PM*?M  <  AW)- 

a .  Verify  that  -<*  is  a  qualitative  probability.  (F1-F4  hold.) 

b.  Prove  that  -<*  is  not  fine.  [Let  A  —  5*  and  argue  that  any  finite  partition  of 
5  must  contain  a  B  for  which  A  <*  B  ] 

c.  Prove  that  <*  is  tight. 

d.  With  P*(A)  -  PM  for  all  A  £  S,  does  P*  satisfy  (14.16)? 

12.  ( Continuation .)  Let  Slt  St,  Pu  P»,  and  S  be  defined  as  in  the  preceding 
exercise,  let  A1  —  A  n  St  and  At  —  A  n  St  for  any  A  £  S,  and  define  A  <*  Bo 
PMd  +  PM  <  PM  +  PM  or  (P1(A1)  +  PM  -  PM  +  PM, 
PM  <  PM). 

a.  Show  that  <*  is  a  qualitative  probability. 

b.  Prove  that  -<*  is  fine. 

c.  Prove  that  is  not  tight.  [Let  A  —  Su  B  **  St  and  show  that  if  <*  is 
tight  then  A  — *  B.  But,  by  definition,  B  <*  A.] 

d.  With  P*(A)  =  HPM  +  PJAJ],  does  P*  satisfy  (14.16)? 

13.  ( Continuation .)  Let  St,  St,  Plt  and  ",  be  as  given  in  Exercise  11.  Let  St  ” 
[4,  5]  and  let  F,  be  a  finitely  additive  extension  of  Lebesgue  measure  on  Sv  Take 
S  «■  St  v  U  S9,  let  Ai  =  A  n  Sf  for  1 ,  2,  3  and  any  A  £  S,  and  define  -<*  by 
A  <*  BoPMx)  <  PM  or  (PM  -  ^(^i).  PM  +  PM  <  PM  + 
Pj(b,))ot  (P^Ai)  =P1(Bl),PM  +  PM  =  PM  +  PM'PM  <  PM)- 

a.  Verify  that  <*  is  a  qualitative  probability. 

b.  Show  that  <*  is  not  fine. 

c.  Show  that  <*  is  not  tight. 

d.  With  P*(A)  -  PM>  does  P*  satisfy  (14.16)? 

14.  ( Continuation .)  In  each  of  the  three  preceding  exercises  argue  that,  for  each 
positive  integer  n,  there  is  an  n  part  uniform  partition  of  S.  It  then  follows  from 
Exercise  9  that  P*  as  defined  for  each  of  the  three  preceding  exercises  is  the  only 
probability  measure  on  S  that  satisfies  (14.16).  Then  show  that,  in  each  of  the  three 
cases,  there  are  A,  B  s  S  for  which  A  <*  B  and  P*(A)  »  P*(B),  so  that  (14.2) 
cannot  hold. 

Note:  In  the  remaining  exercises  F  is  the  set  of  all  functions  on  S  to  X. 


Exercises 


213 


15.  Prove  that  (PI ,  P2)  =*-  ■<  given  A  is  c  weak  order. 

16.  Savage  (correspondence).  Let  S  —  (0, 1]  with  P*  on  S  an  extension  of 
Lesbesgue  measure  on  (0, 1]  so  that,  for  example,  P*(£c,  b\)  -  b  -  a  when  0  £ 
a  £  b  <;  1.  Let  X  »  [0,  oo)  and  take  u(x)  *  x,  so  that  Pis  the  set  of  all  nonnegative 
real  functions  on  (0, 1].  Admitting  the  casn  of  E[u(f(s)),  P*]  —  E(f,P *)  ■»  oo 
(see  Section  14.5),  with  oo  «  oo,  take  f  <gti  and  only  if  E(f,  P*)  <  E(g,P*). 

a.  Show  that  P2  fails  in  this  situation,  by  considering  four  acts  with /  —  /'  ■»  1, 
g  -  g'  »  o  on  A  “  [0,  |),  and  f  “  g  and  /'  —  g'  on  Ae  with  E(f ,  P*)  «  oo 
and  E(f  \  P*)  finite. 

b.  Verily  that  PI  and  P3-P7  hold. 

17.  Prove  that  if  P1-P5  hold  then  A  is  null  o  ,4  0. 

1$.  Verify  that  (14.3)  implies  that  all  discrete  probability  measures  on  X  are  in 
if  «  {Pf:/e  F)  under  (14.8). 

19.  Let  Pj  be  the  conditional  probability  measure  of  P*  given  A  when  P*(A)  >  0, 
with  PJ(P)  -  P*(A  n  B)/P*(A)  for  all  B  £  $.  Verify  that  P1-P7  imply,  for  all 
/,  geF  and  A  £  St  that  f<g  given  AoP*(A)  -  0,  or  E[u(f(s))tP% ]  £ 
£itKg(s)),  P*]  when  P*(A)  >  0. 

20.  Savage  (p.  78).  Let  S  «*  (1, 2, , . let  X  —  [0, 1),  and  let  P*  be  a  diffuse 
measure  on  S  with  P*(s)  -  0  for  all  s  £  S  and  P*{n  +j,2tt+  j,  3n  +j, . . .}  =*  l/n 
for  all  n  >  0,  j  ;>  0.  Define  -<  on  P  by  /  ■<  ^  o  w(f)  <  w(^)  where 

w(f)  -  P(/,P*)  +  inf  (P*{/(s)  k  1  -  «}:*  >0}. 

o.  Prove  that  if  ....  An}  is  a  partition  of  S'  and  if  /  =  { i:P*(A{ )  >  0}  then, 
withP*,  as  defined  in  Exercise  19, 

*(/)  =IP*W/,^)  +  M{P*Ai{f(s)  ^  1  -  *}:«  >0}). 

6.  Verify  that  P1-P6  hold. 

c.  Show  that  PI  is  violated  by / and^  where /  “  0  and^  =  $  on  the  odd  integers 
and  /(«)  =*  g(n)  ==  /»/(«  -f  1)  for  each  even  integer  n. 

21.  ( Continuation .)  In  Chapter  10  we  saw  that  Axiom  A4b  (Section  10.4)  is  not 
sufficient  for  the  general  expected-utility  result  when  the  measures  in  if  are  not  all 
countably  additive.  In  the  context  of  the  present  chapter  the  correspondent  of  A46 
is  P7b:  Ifx  <  g{s)  given  A  for  all  se  A  then  x  <  g  given  A;  ifg(s)  <  x  given  A  for 
all  seA  theng  <  x  given  A .  Verify  that  Plb  holds  for  the  example  of  the  preceding 
exercise. 

22.  Modify  the  example  of  Exercise  20  to  give  a  case  where  P1-P7  hold  and  where 
the  following  assertion  is  false:  If f(s)  <g(s)  for  all  se  A  and  A  is  not  null,  then 
/  -<  g  given  A. 


ANSWERS  TO  SELECTED  EXERCISES 


2.1b.  For  each  iG  {. . . ,  -2,  -1, 0. 1, 2, . , let  /(0)  -  1 ,/(/)  -  2 i  when  /  >  0  and 
/"(/)  =  —2/  +  1  when  /  <  0. 

2.4*.  ~  is  reflexive  since  <  is  reflexive.  ~  is  symmetric  from  its  definition.  If  *  ~  y 
and  y  ~z  then  (x  <  y,  y  <  x)  and  (y  <  z,  z  <  y),  which  by  the  transitivity  of  < 
yield  (x  <  z,  z  <  *), 

2.5.  If  *  •<*  x  then  not  *  -<*  *  by  asymmetry,  a  contradiction.  Hence  ■<*  is  irreflexive. 
Suppose  x  <*y,y  ■<*  z.  Then  x  -<  ®j,  Xg  ■<  *t, . . . ,  xm  -<  y,  y  ■<  ylt  yt  •<  y*, . . . , 
yB  -<  z,  so  that  x  -<*  z. 

2.7.  A  =*  {x, y, z}  with x  <y,y  <*,  and x  ~ z. 

2.9.  Define  x  and  y  as  equivalent  if  and  only  if  they  are  in  the  same  element  of  the 
partition. 

2.11.  Suppose  -<  is  transitive  and  x  ~y  -o-  /(x)  n  7(y)  0 .  Clearly,  ■<  is  irreflexive 

since  /(x)  n  /(x)  ^  0 .  Suppose  (x  ■<  y,  z  -<  w)  so  that  /(x)  n  /( y)  =  0  and 
7(z)  n  7(h>)  *=  0 .  Then  either  7(x)  n  I{w)  =  0  in  which  case  either  x  ■<  w  or  w  <  x 
(and  hence  z  -<  y  by  transitivity);  or  7(x)  n  f(z)  «  0  in  which  case  either  *  <,  * 
(and  hence  x  -<  w  by  transitivity)  or  z  -<  x  (and  hence  z  <,  y);  or 
7(y)  n  7(z)  «=  0  •  •  • ;  or  7(y)  n  7(w)  «  0  •  •  * . 

2.14.  T£  =>  Transitivity.  Let  TE  hold.  Suppose  Transitivity  fails  with  y  6  F({x,  y}), 
z  G  /^({y,  z}),  and  {x}  =  £({*,  z}).  Then  z  £  F({x,  y,  z})  by  TE.  If  y  G  /'({x,  y,  z}) 
then  z  £  F({y,  z})  by  TE,  a  contradiction.  Hence  y  £  F({x,  y,  z}).  Therefore 

{x}  =  F({x,  y,  zj).  Then,  by  TE,  £({*,  y}),  another  contradiction.  Therefore 
F({x,  y,  z})  =  0 ,  which  is  false.  Hence  Transitivity  m'ist  hold  when  TE  holds. 

2.15.  /(xlfx8)  =  Xj  +  .5(x,  +  x8  -  ?Vx,  +  x2  —  1)  gives  a  one-to-one  correspondence. 

2.17.  «(*i,*a)  =  a*i  +  xg  with  a  >  1  will  do. 

2.21b.  (x,  y)G  (A  u  B)'  o  (y,  x)  e  A  or  (y,  x)  e  B.  (x,  y)  G  A'  u  B’  o  (y,  *)  G  /I  or 
(y,  x)  G  A  (X,  y)  G  (A  n  fl)'  (y ,  x)  e  A  and  (y,  x)  e  A 

3.1.  If  the  subset  is  countable  let  it  be  enumerated  as  0.x11x11xSi  •  •  * ,  0.xl2xagXM  ■  •  • , 
O.XjjXggXjg  Let  x*  ^  x<(  for  all  /,  xt  G{1, 2}.  Is  O.xjXgX,  •  ■  •  in  the 

enumeration? 

3.3.  Let  U(x)  =  (|*| ,  x).  Then  *  ■<  y  if  and  only  if  (/(*)  <L  £7(y)  where  (a,  b)  <L  (c,  d ) 
if  and  only  if  a  <  c  or  [a  =  c  and  b  <  dj. 

3.6c.  (I,  -2,4,  -3). 

3.7b.  130- 

3.8.  Let  JC  be  the  unit  square  with  (0, 1)  -<  (1 , 0).  The  set  of  all  a(0, 1)  + 

(1  —  a)(l,  0)  =  (1  —  a,  a)  is  the  straight  line  segment  from  (0, 1)  to  (1,0).  Cut 
you  draw  a  valid  indifference  curve  that  intersects  this  segment  at  several  places? 

215 


216  ilium  to  Selected  Exorebet 

3.12.  Yes.  If  the  *  g  Y  are  numbers,  X  is  finite,  and  "6  »•  {A  n  X:A  e  *11},  then  u  on  X 
is  continuous  in  IS. 

3.16.  Given  c€  (»(»),  «(y))  suppose  c  ft  u(z)  for  every  *e  X.  Let  Y  -■  {*:*€  X, »(»)  <  c}, 
2  «•{*:*£  X,c  <  «(*)}.  Yand  Z  ere  nonempty,  disjoint,  Yuz  —  Y,  and  since 
{6:6  <  c}  and  {a:c  <  a)  are  in  11  and  « is  continuous.  Y.ZeTS,  contradicting  the 
connectedness  of  (X,  V). 

3.23.  If  X  is  not  connected  then  it  can  be  partitioned  into  nonempty,  disjoint  subsets 
Y and  Z  that  are  both  in  {A  n  X:A  6  *11*}.  If  Xu  convex  and  y  €  Y,  *eZ,  then 
Um,  *)  —  {«y  +  (1  —  #)*:*€  {0, 1]}  is  in  X and  (L(y,  s),  {A  n  L(y,  z):Ae  *U.*}) 
is  connected.  But  by  A"  not  being  connected  we  must  conclude  that  L(y,  t)  rs  Y 
and  L(y,  «)A2  are  nonempty,  disjoint  open  sets  in  {A  n  L(y,  *):A  g  'll"}  that 
partition  L(y,  *),  so  that  L(y,  s)  is  not  connected,  a  contradiction. 

4.2.  4!  «  24.  Eight  are  additive. 

43-  a,  c,f,g- 

4.4.  +  (*!*,)*  *  +  *!*,).  With  «,  6  £  1,  a  £  6  <=>  a(l  -f  a)  g  6(1  +  6). 

Sec  Exercise  3a. 

43.  Hint:  Include  in  (  )E4(  )  the  six  elements  whose  utilities  are  duplicated 
(8,9.13). 

4.12.  Yes. 

4.15.  For  Theorem  4.2  let  C  «•  («i(*n),  ux(xlt), ....  «»(*„),  fff*1),  ....  o{x*)). 

4.17.  Let  c  «  («,(*„), ....  «*<**,),  1). 

5.1a.  ^(0)  —  0,  iijO)  =*  2,  Mj(r)  **2  ~  e~T  when  r  £  0  and  it#(r)  “  er  when  r  <  0 
will  do. 

5.1b.  Assume  additive  utilities  exist,  let  a  ■»  «t(l)  —  «,(0)  >  0,  M  «*  «x(l)  —  «x(0)  >  0, 

Pi  »■  »x(l/0  —  »x(l/(l  +  0)  for  /  «■  1,2,...  and  show  that  M  >  mot  for  every 
positive  integer  m. 

5.4.  Suppose  m  *■  3,  n  ■«  —2.  Then  3*  —  2*«:e4**  +  (x  —  *)  — **= 

»  +  *  +  *-  *»*  + (*-z)»*+e*:E. 

5.12. 

5.13.  II^jSlI^  since  when  6  "C(  for  each  i.  Suppose  A  r*  0 , 

<feII15{.  For*  6  A  let  At(s >)€%  be  such  that  xt  g  At{x),  II  A{  £  A,  U  .44eII*U4. 
Also,  <n  ^,(*»  *  A  so  that  A  en*  TSf.  Thus,  n  £  II *  1S4. 

5.19.  Suppose  Ve  *IL.  Then  V  *  Uirt  4(r),  where  A(t)  is  an  open  interval  in  Re  for 
each/ er,  by  Exercise  18.  Therefore /■-*((/)  -  is  in  ^  when  /^(t)) 

is  in  T3  for  every  rg  T, 

5-23.  =  (2, 1,3, 1.5,  5,  8)  gives  u(*vxt)  <  «(yx,  yj,  utvx,  *t)  < 

«(*!.**).  and  «(*!,  yt)  <  u(xv  z^),  which  contradict  Q 1. 

6.1b.  [x1, ....  y1, . . . ,  ym  is  a  permutation  of  z1,  w1 . wm, 

(**,  yO  ■<  (**,  w*)  or  (**,  yt)  ■—  (z*,  w*)  for  all  y  <  m]  =>  not  (***,  yw)  -<  (**",  wm). 

Alternatively,  K*1.  V1).  •  •  • .  (*’*.  y")  Em  (**,  w1) . (*",  w”),  (**,  yO  -<  (*',  w*) 

or  (x*,  yt)  ~  {st,  wi)  for  all  /  <  mj  =>  not  (*",  y“)  <  (*",  wm),  and  (*,  y)  *—  (y,  *) 
for  ail  *,  y  e  X. 

63.  x  —  y<^-**~H»=>not*  —  y<**~w=>not*  —  *-<*y  —  w(by  vd.2))=> 

C*  —  s  y  —  w  or  y  —  w  ■<•  *  —  *]  the  latter  of  which  gives  *  —  *  *<•  w  —  y 


Atuwtrt  to  Selected  Exercises 


217 


by  (6,1).  Abo, *  —  y  —  w  => not *  —  *♦>■<**  —  y=> not t  —  z  -<•  w  —  y 
by  (6.2).  Therefore  x  —  y  <—*  *  -  w=>x  —  t »— *  y  —  w.  (And  so  forth.) 

6.9.  For  negative  transitivity,  not  x  <y  =>y  —  x  x  —  x  and  not  y  •<*=>*  —  y<* 
y  —  y.  Hence  *  —  y  ^  —  *,  then  *  —  x  <  •  y  —  x,  then  s  —  *  ^  —  *. 

6,15a.  Asymmetry  of  ■<•  is  immediate  from  asymmetry  of  -<.  For  negative  transitivity, 
not  x  —  y  -<•  *  —  iv  =>  not /(*,  w)  ■</(*,  y)  =>/(*,  y)  < /(*,  tv),  and  not 
*  -  tv  •<•  5  -  t  =>  not /(*,  r)  </(j,  tv)  =>/(*,  »v)  </(*,  f  ).  Using  C3,  C2,  C3,  C2, 
and  C3  spin,  y),f(w,  *))  ~/(/(j,  w),/(y,  *))  <  /(/(*,  i),/(y,  *))  ~ 

/(/(*.  *),/(*.  *))  <  /</<*.  tv),/ (*.  *))  ~/(/(*,  #)./' (w.  *)):  by  transitivity, 

/(/(*.  y)./(tv,  *))  <  /(/(*,  0,/(tv,  *)),  whence /(j,  y)  <  /(*,  /)  by  C2,  which  says 
that  j  —  /  <  *  *  —  y,  or  not  x  —  y  ■<*  j  —  r. 

6.16.  To  show  that /(/(*,  y ),/(*,  tv))  ~/(/(*,  *),/(y,  iv))  let  a  *■  /(*,  y),  6  »  /(a,  tv), 
c  «■  /(*,  *),  d  “  /(y,  tv).  We  are  to  show  that  a  —  d  c  —  b.  The  permutation 
condition  of  2t(  holds  tor  x  —  a  a  —  y,  tv  —  6  6  —  s,  c  —  *  x  —  c, 

d  —  y  w  —  d,  b  —  d‘!  c  —  a,a  —  dl*  c  —  6.  If  ?  *  <•  then  not  a  —  d  -<*  c  —  6 
by  11,,  and  hence  c  —  b  -<•  a  —  d  by  2t#,  and  hence  c  —  a  -<•  6  —  d  by  (6.2),  which 
contradicts  b  —  d  ■<*  c  —  a.  Similarly  c  —  a  ■<*  6  —  can’t  hold.  Hence  6  — 
c  —  a,  which  by  it,  yields  a  —  c  —  b. 

7.1.  Since  (*lt  ...,*„)  ~  (»,,...,  xn,  Xj) - (*„,  x, . *„_!),  T  "<(*<)  “ 

7.4b.  *  <  (*! . y„)  <  (*j, . . .  ,*n-s,yB_i,y«)  <  •  •  •  <  y. 

73.  For  the  last  part:  A  *  {a,  6},  it  =  2,  (a,  a)  -<  (6,  a),  (a,  a)  -<  (a,  6),  (a,  6)  ■<  (6,  b), 
(b,  a)  <  {b,  b )  and  (a,  a)  ~  (*,  6). 

7.13k.  Suppose  a  0,  a  —  M/N  where  M,  N  arc  nonzero  integers.  Then 

x  ~  yo  Mx  ~  My  by  *  and  j.  Since  aA7  =  M,  x  —  y<=>  Nclx  ~  Afay,  and  by  e 
and  j,  Nax  ~  AAx yo  ax  —  «y. 

6.1.  Expected  net  profit  maximized  at  about  x  =  235000. 

8.4a.  v(e)  -  3,  i>(w)  =  9.  (d)  v  -  5«  +  5. 

83.  a  -  .4. 

8.6d.  Show  that  (conditions  1 ,  2,  not  3)  =>  not  4.  In  violation  of  3  assume  that 

P  -<  Q  <  R  and  Q  %P  +  (1  —  a)R  for  ail  a  e  (0, 1).  Show  first  that,  for  every 
«.  pe  (0,  1),  (1  -  p)Q  +  fiR  <  (1  -  «)(1  -  P)P  +  IP  +  0  -  P)*]R  ^ 

(1  -  *)(1  -  P)Q  +  IP  +  (1  -  P)X]R.  (Note  that  Q  <  xP  +  (1  —  a )R  for  all 
a  e  (0, 1).  Why?)  Suppose  that  S  <  To  u(S )  <  u{T)  for  all  S,  Te  IT,.  Let  f(p)  - 
«((1  —  p)Q  +  PR),g(ff)  =  «((1  -  p)P  +  pR)  for  alt  P  e  (0, 1)  and  note  that  if 
P  <y  then  f(p)  <,g(y)  < /(y),  with  y  =■  P  +  (1  —  p)x.  Show  that 
sup  {f(P)-P  <  y)  <  f(y)  so  that  /  is  discontinuous  at  every  point  in  (0, 1).  This  is 
impossible  (why?)  and  therefore  such  a  «  does  not  exist.  Then  use  Theorem  3.1  to 
show  that  condition  4  is  false. 

8.7.  See  Exercise  6c. 

8.11a.  For  (A,  O,  P(*30)  *  .27,  P(S70)  =  .63,  P(S80)  *  .03,  F($120)  =  .07.  The 
theory  of  this  chapter  does  not  say  that  ( B ,  D)  wilt  be  preferred. 

8.13.  No.  Yes. 

8.16*.  y  =  $15000. 

8.16b.  y'  «  y.  Given  A,  he  would  sell  it  for  an  amount  with  the  same  utility. 


218 


Aotwtrt  to  Seltcted  Exerelut 


8.16c.  10  ~  ($40000  -  *  with  pr.  1/2  or  $0  —  *  with  pr.  1/2).  *  ^  $18000.  If  he  paid 
$18000  for  A  he  would  be  taking  a  SO-SO  gamble  between  net  increments  of  $22000 
and  —$18000,  which  has  a  utility  of  about  zero,  which  is  what  he  started  with. 

8.16d.  Of  course  not.  In  the  two  situations  he  is  considering  different  amounts  of  total 
wealth.  He  would  sell  it  for  $18000  or  more. 

8.16e.  ($25000  with  pr.  1/2  or  -  $15000  with  pr.  1/2)  $15000.  w  ~  $20000. 

8.16f.  With  y  as  in  part  a,y  ~  ($0  —  r  with  pr.  1/4  or  $40000  —  r  with  pr.  1/2  or 
$80000  -  r  with  pr.  1/4). 

8.1«g.  ($25000  with  pr.  1/2  o.  -$15000  with  pr.  1/2)  ~  ($0  -  $15000  -  s  with  pr.  1/4 
or  $40000  -  J 15000  -  r  with  pr.  1/2  or  $80000  -  $15000  -  s  with  pr.  1/4). 
j  ==  $12500. 

9.1,  For  the  converse  of  £5  suppose  otP  +  (1  —  <*)R  *  olQ  +  (1  —  *)R  with  «  e  (0, 1) 
and  not  P  Q.  Then,  for  example,  P  ~  Tand  T  -<  Q.  By  B2,  aP  +  (1  —  a )R  ~ 

*T  +  (1  —  a )R.  By  B\ ,  uT  -f  (1  —  <*)R  <  <*Q  +  0  —  These  contradict 
aP  +  (1  —  <*)R  aQ  +  (1  —  a)R. 

9.3.  Suppose  P  ~  Q,  Q  ~  R.  Then  and  Q  by  B2. 

Therefore \{\P  +  \R)  +  \Q  ~$R  +  £<2  by  34.  Therefore  \P  +  \R  ~  R  by  {B\ ,  B2). 
Hence  P ~ R  by  (Bl,  B2). 

9.5.  If  P Q  but  P  3*  Q  then  by  a  $1  change  in  a  consequence  of  P  or  else  a  small 
change  in  two  probabilities  in  P  it  would  seem  possible  to  get  a  P*  that  is 
indifferent  to  Q  but  either  preferred  to  P  or  less  preferred  than  P. 

9.7.  Suppose  (z  —  y)*  >  inf  {(*  —  y)*:x  6  X)  for  all  *  £  X.  Then  there  is  a  sequence 
2| ,  *8, . , .  in  X  such  that  (zz  —  y)s  >  (*2  —  y)®  >  •  ‘ '  and 

inf  {(z„  —  y)a:«  *=1,2,...}=*  inf  {(a?  —  y)*:*e  X}.  Then  there  is  a  z  such  that  every 
open  set  in  Re“  that  contains  *  must  contain  some  Since  the  closure  of  X  is  X  it 
follows  that  z  e  X  and  that  (*  —  yf  =  inf  {(zn  —  y)%:  n  =  1,2,...},  which 
contradict  the  original  supposition. 

9.9d.  {(a,  a):0  <,  a  £  1}. 

9.13(2).  A,  £  0  for  all  /  and  £  X,  >  0. 

10.4c.  (0, 1)  except  for  1/2, 1/3,  1/4, . . .  . 

10.5.  All  subsets  of  Re  that  contain  either  a  countable  number  of  elements  or  all  but  a 
countable  number  of  elements  in  Re. 

10.7.  If  sup  {r  +j:re8,reS}  <supR  -f  sup  S  then  sup  {r  +  sr  •  •}  <r  +  s  for 
some  r€R  and  s e  S,  so  that  r  +  s  <  r  +  s. 

10.9.  If  sup  KjPu'-i  =*  1,2,...}  <  2?  otjrtsup  {/?«:/  =  1,2,...}]  then 

5Xi  *  Pit  +  e  <  2/1 1  [sup  {Pa'i  =1,2,.. .}]  for  some  m,  some  c  >  0,  and 
t  ■*  1, 2, . . .  .  Let  Pifi  be  such  that  sup  {/?<>:/  =  1, 2, . . .}  piti  +  e/M  for 
j  **  1,2, . . . ,  m.  Let  s  be  the  largest  such  i{.  Then  sup  {Pn'.i  =  1, 2, . . .}  £ 

P'i  +  cIMforj  =  1,2, ...  ,m,  and «^sup{/Sw:/=  1,2, . ..}  £ 

JSL I  *tV«  +  <  2r-i  +  '•  Hence  <  2r-i  “A/ for  411 '  and 

hence  for  t  »  r,  which  is  impossible. 

Suppose  T£j  aq  sup  {Pti :i -  1, 2. . . .}  <  sup  {]§>!  =  1,2,...}.  Then 

sup(Pu:i  =  1, 2, ...}  +  «  <  2®  for  some k,  some  »>  0 and  all  it. 
Hence  sup {/?«:/  =  1,2, . ..}  <  *;/*« f°r some  m  and  all  n.  But 

2S£*i  “Ai  ^  2?*i  *f  SUP  =1,2,...}  and  a  contradiction  is  obtained. 


Ananers  to  SeltctsA  Exorcises 


219 


10.13,  Let  Av  At,...bc  mutually  disjoint  elements  in  A  with  A,  e  A,  Then 

2,-i  «i*V<  U£i  Af>  ~  2#  *<  2*  -  2<a<  SUP  (2"-i  p{(AH  :«  =  1,2,...}- 

SUP (L-i  *<<2"-i -  ». 2. ■  •  }  [by  Exercise 9]  =  sup {J*.,  2*“i *<P< 

10.15.  Let  . . .  and  g\,g%, ...  be  sequences  of  simple  ^-measurable  functions 
satisfying  (1)  and  (2)  of  Definition  10.1 1 ,  and  suppose  that  sup  {E(fn,  P)}  < 
sup  {E(gn,  P)}.  Then  £(/„,  P)  +  <  <  E(gm ,  P)  for  some  m,  some  <  >  0,  and  all  n. 

Let  An  =  {x  #„,(*)  £/„(*)  +  </2}  so  that  s  A2  £  *  *  *  and  P(At)  £  P(A„)  £  ■  •  • . 
Also,  JT  =  Ur.i^n  50  that  1  =  ~  sup {P(An)\n  =  1, 2, , , by 

Lemma  10.2.  Let  M  —  sup  (gm(z)  —  f„(x):x  £  X}.  ’Hien  for  n  ^  m, 

E(gm,P)  -  £(/„,  P)  -  £(*m  -fniP)  £  M[1  -  P(4,)l  +  /MW 2,  the  equality 
coming  from  Exercise  17.  As  n  gets  large,  the  right  side  of  this  approaches  c/2  and 
hence  E(gm,  P)  —  E(fn,  P)  <  *  for  some  «,  a  contradiction. 

10.1*.  £(/,  otP  +  (1  -  *)Q)  -  sup {£(/*,  «P+(1  -  a)0:«  =  1, 2, . . .}  = 

sup  {«£(/„,  P)  +  (1  -  *)£(/„,  0}  -  a  sup  {£(/„,  P)}  +  (1  -  a)  sup  {£(/*,  0} 
[Exercises  6,  8,  22a]  -  <*£(/,  P)  +  (1  -  *)E(f,  Q). 

10.21c.  Use  the  results  of  Exercises  21a  and  19. 

10.22a.  Let  a  <  fix)  £  b,  a  <  g(x)  <,  b  for  all  x.  Let  Ain  and  /„  be  defined  by  (10.9) 
and  (10.10),  and  let  Bi  n  =  (x:a  +  (/  —  l)(i  —  a)//i  <  ,y(x)  <;  a  +  i(b  —  o)/«}  and 
gn{x)  «  a  4-  (i  -  1)(A  -  a)jn  for  all  *6  Let  cirt  =  a  +  ii  -  l)(b  -  a)/n. 

£(/«.  P)  -  2?  .***.  and  £C?».  £>  -  2"  pWiJci«-  £(/».  £  £Cfn.  P)  follows 

from  P(P,n)  ^  P{Ai>n)  for  k  =  1 . «,  which  in  turn  follows  from 

P(/(*)  £*(*))'-!. 

10.25.  {x:x6Jf,y  <*-<*}  =  [{x:y^*}®u{x:x -<*}*)«. 

10,20.  For  Theorem  10.2  let  P(x)  *®  0  for  all  *,  R  —  }P  +  $1. 

10.27.  Show  that  if  9*  £  9  has  elements  weakly  ordered  by  <=  then  S*  =  JJo*  S  is  in  O. 

11.2c.  It  can  be  true  for  some  pair  P,  Qe  J  that  P  —  Q  when  P,  <tQ(  and  P\  =  Q$. 

For  some  other  P,  (2  pair,  P  «<  Q. 

11.2d.  Let  R*,  k  m  1, 2. . . . ,  m  —  1,  be  such  that  PJ  =  Qv  Pje  =  Pf ;  p£  =  Qt, 

P*e  -  P£-le  tor  k  «=  2, 3, . . . ,  m  -  1.  Then  P  <  P1,  P1  <  P*. . . . ,  P"*-1  ■<  0. 

11.2e.  Let  n  —  2,  (xj,  x2)  ~  (yx,  x2)  ~  (xlf  y2)  ■<  V*)  ~ P  for  all  P  on 

{xj.  j/j}  x  {xjj,  y2)  that  are  not  one-point  measures.  Show  that  <,  and  arc 
transitive  and  connected,  that  zl-<%xyx,xt^xyt,  but  (*1(  xt)  —  (yx,  Xj). 

11.2).  An  example  where  A  and  D  hold  but  is  not  connected:  X  =  {xt,  yx}  x  {xt,y8}, 
(®|,  *a)  <  (Vl  X2)  <  (xv  ya)  -<  (ylty2)  <P~Q  for  any  P,  0  that  are  not  one-point 
measures. 

11.2k.  With  a  <ib  there  are  P,  Se  !T  such  that  P  -<  S,  P4  =  o,  St  =  />,  and  PJ  =  5*. 

Ut  r  =  jp  r  =  i£+iP.  r~r'byD.  WithPC5,  if(2~P,  then 
+  iO  <  i-S  +  \Q  ~  +  JP,  or  r  -<  r,  a  contradiction.  P  <  G  by 

definition  of  -<4.  Hence  P  <Q  since  P  ~  Q  is  false. 

11.2®.  Let  X  =>  {xj,  yj  x  {x2,  y2},  take  P  ^  Q  o  (P j(*i)>  P 2(*g))  ^  (Gi(*i)>  Gj(*j)) 
and  on  [0, 1J  x  [0, 1]  take  (a,  (t)  <  (y,  <5)  if  and  only  if  a  <  y  or  [«  **  y  and 
at/?  <  y3J,  with  <  on  [0, 1J*  transitive  and  connected.  D  holds  since  (a,  ft)  ~~  (a,  /U). 
Moreover,  (0,0)  ~  (0, 1)  and  (1,0)  •<  (1, 1):  that  is,  (itv  yj)  <-*  (yj, x^  and 
(*i,  y»)  ■<  (*i.  **)»  from  which  it  follows  that  y2  -<2  x2  but  (yx,  y2)  ~  (yt,  *,).  P  says 


220 


Answers  to  Selected  Exercises 


that  if  (a,  /?)  -<  (y,  &)  and  (j,  k)e  [0, 1 J*  and  te  (0, 1),  then  (/a  -f  (1  —  t)j, 
tp  +  (1  —  t)k)  ■<  (ty  +  (1  —  t)j,  tS  +  (1  —  t)k),  which  is  easily  seen  to  be  true. 

U.2o.  LetP*  rn  Q>  «  Pt  Q{,  ae(0, 1).  ThenP  ^  Q.  By  B and  C, 

otP  +  (1  -  a)J?  <  *G  +  (1  —  *)R.  Since  ttP\  +  (1  —  a)R J  «  a Q\  +  (1  -  a)RJ, 
if  *Q{  +  (1  —  a) Ri  <t  *P{  f-  (1  —  *)R{  then,  by  Exercises  2/and  21, 

&Q  +  (1  —  «)R  <*P  +  (1  —  a)R,  a  contradiction.  Since  <,  is  connected 
(Exercise  2n),  a P<  +  (1  —  <x)Rt  <<  <xQt  +  (1  —  a )R(.  For  the  latter  put  of  the 
theorem  take  P{  <f  Qt. 

11.3a.  E(f,P)  -  2xft(xf)P(x . . V  -  '2xiMmW*i  X  •  •  *  x  x 

{*<}  x  ^+i  x  •  •  •  x  Jf*)  **  ^Xtfi&dP iixd  =  E(fi,P {). 

11.9b.  The  condition  of  part  a  leads  directly  to  u(xlt . . .  ,xit  z®+1, . . . ,  x^) 

*1"  •  *  •  >  ■  i  a  *(*i»  •  •  • .  *  •  •  t  *||) 

+  «(* . . *<_!•  *<•  *?+i . *2)-  Let  «<(*<.  *<+i)  =  uoq . *?_!,  xit 

*<+i> •  •  • » *  •  *  >  *(i  . *J) for  »■*!,...,«  —  2,  and 

“a-l^n-l.O  *  «(*i,  •  •  •  .  *£_*.  *»-!.*»)• 

12.1.  P'({r:*(/)e  A)  r>  {r:s(/) G  A'})IP9(A'). 

12.6b.  [e(/,rt)  -  p(y,r2)]a  —  />*(Jt)(t/(win)  —  u(Iose)],  [v(g, ~  v(f,  sj\a  «= 
P*(ra)[«(win)  —  wflose)],  and  so  forth. 

12.9.  He  would  rather  marry  Alice  but  should  propose  to  Betsy.  Use  a  —  P*(s1)[ 4  —  3], 
2a  =  F*(ij)[4  -  0],  and  4a  =  P*(Js)l3  -  0J. 

13.4.  The  simplest  example  is  ffl*  -*  {A,  B),  —  {.dj,  A2,  B),  and  A4  =*  {Av  At,  Blt  Bj. 

If  A  is  selected  from  55*  then  B  must  be  selected  from  55s,  but  A  u  B  *  5. 

If  B  is  selected  from  31*  and  A1  (or  d2)  is  selected  from  31s  then  A2  (or  A{) 
must  be  selected  from  a4.  But  iU/ijU^sS. 

13.5.  Let  B=f]A  A  with  B*Q>.  If  P*(B)  *  0,  then  P*(Be)  =*  1  so  that  BC£A, 

which  contradicts  B  A.  Hence  P*(B)  =  1.  If  B  has  more  than  one  element 

then  B  can  be  partitioned  into  C  and  D  with  P(C)  =  0  and  P{D)  *>  1 ,  D  c.  B, 
which  contradicts  B  =  f)^  A. 

13.10.  Let  u  be  such  that  «(*«,)  =  inf  {«(*):*£  X)  »  0  and  u(x*)  =  sup  {u(x):x£X}  «*  1. 
Given  Pe  X.  let  Al  n  =  {j;0  £  E(u,  P(s))  <,  l/n }  and  d*  „  =  {s:(f  —  1  )/n  < 

E(u,  P(j))  <,  i/n)  for  /  =  2 . n.  Define  P„  and  QB  in  X  by 

Pn(s)  —  [(i  -  i )/«}**  +  [(«  -  i  +  l)/njx*  for  all  s£Ain  and  Q„(s)  *  (//»)**  + 

[(n  —  for  all  s  e  /<(  n;  (  =  1 It  follows  from  B1  that 

P„  <  P  <  Qn  and  hence  that  v(Pn)  <,  v(P)  <;  v(Qn)  for  all  it,  where  v  is  as 
defined  in  the  proof  of  52.  Hence,  by  (13.9)  for  all  horse  lotteries  in 
tfo.  E[E{u,  P„(j)>,  P* )  <,  r(P)  ^  E[E(u,  Qn(s)),  P*]  for  all  n. 

13.12.  Let  Q  be  as  defined  following  (13.12).  Assume  c  —  0,  d  ■»  1  for  convenience. 

[If  c  «■  d  the  result  is  immediate.]  Let  R,  «=  Rt  on  5  with  E(u,  Rt)  «  // 4  for 
/  «  1 , 2, 3.  Since  0  £  E(u,  Q(j))  ^  1  for  all  s,  1/4  £  £(«,  |Q(j)  +  JR*)  ^  3/4 
for  all  j  6  5.  Therefore  Rt  <  JQ(j)  +  \Rt  ^  R3  for  all  re  5.  Hence,  by  B7, 

Ri  <  iQ  +  iR*  <  «s  By  (13.10)  and  (13.11),  p(Rj)  £  ^(R*)  +  ip(Q)  ^  v(RJ. 

Then  by  (13.9)  for  3^,  1/4  ^  Jp(Q)  +  1/4  £  3/4,  or  0  £  v(Q)  £  1. 

13.15a.  Given  *  >  0  let  B(e)  =»  {j:oc£(u,  P(i))  +  (1  —  «)£(«,  R(s»  ^  1  —  and 
C(«)  =  {j:j65(«),  and  E(u,  P(s))  <  1  —  e  or  E(u,  R(j))  <  1  —  «}.  Let 
<5  «s  a(l  —  a)«.  Then,  if  ie  C(«),  s  cannot  be  in  B(8)  since  *(1  —  *)  +  (1  —  «) 

<  1  —  «  and  <t  +  (1  —  a)(l  —«)<!—<.  Hence  the  only  elements  in  B( c) 


Answers  to  Selected  Exercises 


221 


for  any  <  >  0  that  can  contribute  to  inf  {£•{<*£(<<,  P(j))  +  (1  —  e)E(u,  R(x)) 

2>  1  —  <}: e  >  0}  are  those  for  which  both  E(u,  P(j))  ^  1  —  <  and 

E(u,  R(j))  ;>  1  —  t.  As  a  consequence,  inf  {£*{a£(«,  P(j))  -f  (1  —  a )£(«,  R(j)> 

£  1  -  e):e  >  0}  -  inf  {£*({£(«,  P(s))  2:  1  -  «}  rs  {£(*,  R(j))  2;  1  -«}):«>  0}. 

13.15c.  Let  P,  Q,  R  €  JC  be  as  follows.  On  the  even  integers,  £(«,  P(r»  ■>  j/(1  +  s), 
E(u,  Q(s))  =  E(u,  :.<*))  ■  0.  On  the  odd  integers,  P(r)(J)  **  1,  and  E(u,  Q(s)) 

-  E(u,  R(s))  -  s/(l  +  i).  Then  e(P)  =  Ij  +  (J)(J)]  +  J  -  5/4,  t>(Q)  -  i  +  i  -  1. 
KjP  +  JR)  -  J(3/4)  +  i(i)  +  0  -  5/8,  and  d(JQ  +  |R)  -  J(|)  +  J($)  +  §  -  1. 
Hence  Q  <P  and  $P  +  JR  <  JQ  +  JR. 

14.2b.  Consider pr  <*  s, ps  <*qr,  rs  <*pt,  and  qt  < * prs. 

14.4.  £s  C=>C  =  B\J  ( C/B ).  By  (FI,  F3),  0  ■<  *  C/E  so  that,  by  (£3,  FA), 

0  V  B  <*  (C/B)  U  £,  or  £  ■<♦  C.  If  £  «<*  C  then  C  U  Ce  -<*  C,  which  by 
£4  implies  Cc  0 ,  contradicting  £1. 

14.5e.  A  <*  C=>B< *  D=t>A  u  £  <*  C  u  Z)  by  C3«m),  a  contradiction. 

14.10.  Let  £5  hold.  Then,  by  £5  directly,  -<*  is  fine.  If  A  -<*  £,  it  follows  easily 
from  £5  and  the  other  properties  of  C*  that  A  u  Z>  <*  B  for  some  Z>  for  which 
A  r»  D  =  0  and  0  •<*  D.  A  similar  result  is  obtained  iSB  <*  A.  Hence,  if  the 
“it<*auc  and  £  <  *  A  kj  D  for  all  •  •  •  ”  conditions  of  tightness  hold,  then 
neither  A  -<*  B  nor  B  <*  A,  which  requires  A  B  by  £3. 

On  the  other  hand,  suppose  £l-£4  hold  and  is  fine  and  tight.  Take 

A  -<*  B,  Suppose,  for  all  £,  £  B  for  which  Bx  ■<•  B,  B1  <  *  A.  Then  consider 
0  -<*  D  and  A  r\  D  =  0 .  By  fineness,  it  follows  that  there  is  a  Bt  £  B  such 
that  0  -<*£a  <*  D.  Then  since  0  ■<*  £a,  B/Ba  •<*  B  so  that  £/£*<*  A, 
which  along  with  B2  <  *  D  gives  B  ^  *  A  U  D  by  C3.  Tightness  then  requires 
that  A  £,  which  is  false.  Hence,  with  /<  -<•  £,  there  is  a  B1  £  B  for  which 

A  -<•  Bv  -<*  B.  Since  0  ■<*  B/B1  and  -<*  is  fine,  there  is  a  partition  {Q . Cm} 

of  £  with  C£  <*  £/£x  for  each  /.  Along  with  A  -<*  Bv  this  gives  A 
by  C3«*). 

14.11c.  Given  A,  B  let  the  “whenever”  conditions  of  tightness  hold.  If  no  C  satisfies 
£  n  C  =  0  and  0  -<*  C  then  8  £  so  that  A  4,  *  B,  If  some  C  satisfies 

0  <*  C  and  B  r\  C  =*  0  then,  for  any  such  C,  either  £j(.4j)  <  P1(B1)  +  £i(Cj) 
or  PMi)  =  +  Pi(C,)  and  £*(/!*)  <,  P,(Bt )  +  £,(Qj.  If  £S(C*)  -  0  for 

ali  such  C2  then  £a(£a)  =*  1  which  insures  Pt(A^)  ^  £g(£j),  and  since  P^Cj)  >  0 
can  be  made  arbitrarily  small,  we  get  also  PX(A{)  <,  >*i(£j).  Hence,  if  PjtCj)  m  0, 

A  <  *  £.  If  Ps(Ca)  >  0  for  some  such  Cg,  then  we  can  take  a  Cl  with  PX(CJ  =  0 
and  get  P1(/<1)  ^  Pi(Bx),  where,  if  equality  holds,  it  must  then  be  true  that 
£|(/4a)  ^  £j(£a).  Again,  /<<*£.  By  a  similar  proof  we  get  £  <  *  /(  for  all  cases. 

Hence  ^  £. 

14.13c.  Let  A  ms  (0,  J)  u  £t,  £  »,  (J,  1]  u  £^.  If  ■<*  is  tight,  then  A  £.  But 
£  *<•  ^  by  the  definition  of  -<*. 

14.17.  If  A  is  not  null  then,  with  x  -<  y  and  /  ■*  x  on  A,f  *=  y  on  Ae  and  g  «  y  on  £, 
f<g  given  A  by  £3  so  that  f  <g  by  Lemma  14.1.  But  if  A  0  then 
Therefore  0  A  implies  that  A  is  null. 

14.20b.  For  £6,  suppose  f<got  w(f)  <  »v(^).  Let  wig)  —  w(/)  ■*  d.  Take  a 
partition  {{l.n.  In, . . .},  {2,»  4- 1, 2n  +  1, . . . . . ,  {«  —  1,2«  —  1, . . .}} 
with  n  large  enough  so  that  2  <  dn,  and  use  the  answer  to  part  (a). 


REFERENCES 


The  references  given  here  are  those  cited  in  the  text.  For  a  more  complete  bibliography 
see  Fishburn  (1968)  and  the  other  surveys  cited  in  its  introduction. 


Adams,  E.  W.  (1965).  Elements  of  a  theory  of  inexact  measurement.  Philosophy  of 
Science,  32, 205-228. 

Allais,  M.  (1953).  Le  comportement  de  l’homme  rationnel  devant  le  risque:  Critique 
des  postulats  et  axiomes  de  l’ecole  Americaine.  Econometrica,  21,  503-546. 
Anscombe,  F.  J.  and  Aumann,  R.  J.  (1963).  A  definition  of  subjective  probability. 

Annals  of  Mathematical  Statistics,  34, 199-205. 

Armstrong,  W.  E.  (1939).  The  determinateness  of  the  utility  function.  Economic 
Journal,  49, 453-467. 

- (1950).  A  note  on  the  theory  of  consumer’s  behaviour.  Oxford  Economic  Papers, 

2,119-122. 

Arrow,  K..  J.  (1959).  Rational  choice  functions  and  orderings.  Economica,  26, 121-127. 

- (1966).  Exposition  of  the  theory  of  choice  under  uncertainty.  Synthese,  16, 

253-269. 

Aumann,  R.  J.  (1962).  Utility  theory  without  the  completeness  axiom.  Econometrica, 

30,  445-462. 

- (1964).  Utility  theory  without  the  completeness  axiom:  a  correction. 

Econometrica,  32,  210-212. 

- (1964).  Subjective  programming.  In  M.  W.  Shelly  and  G.  L.  Bryan  (Eds.), 

Human  judgments  and  optimality.  Wiley,  New  York. 

Bernoulli,  D.  (1738).  Specimen  theoriae  novae  de  mensura  sortis.  Commentarii 
Academiae  Scientiarum  Imperials  Petropolitanae,  5,  175-192.  Translated  by  L. 
Sommer,  Econometrica,  22  (1954),  23-36. 

BirkhofT,  G.  (1948).  Lattice  theory.  Rev.  Ed.  American  Mathematical  Society,  New 
York. 

Blackwell,  D.  and  Girshick,  M.  A.  (1954).  Theory  of  games  and  statistical  decisions. 
Wiley,  New  York. 

Blaschke,  W.  (1928).  Topologische  Fragen  der  Differentialgeometrie.  I.  Thomsens 
Sechseckgewebe.  Zueinander  diagonals  Netze.  Mathematische  Zeitschrift,  28, 
150-157. 

Cbemoff,  H.  (1954).  Rational  selection  of  decision  functions.  Econometrica,  22, 

422-443. 

Chipman,  J.  S.  (1960).  The  foundations  of  utility.  Econometrica,  28, 193-224. 


V&J  KbUrtfcfc-'  .wM '*■ 


224 


J \tftrtmeu 


Coombs,  C.  H.  (1964).  A  theory  of  data.  Wiley,  New  York. 

Cnuner,  H.  (1956).  A  theorem  on  ordered  sets  of  probability  distributions.  Theory  of 
Probability  and  Its  Applications,  I,  16-21. 

Debreu,  G.  (1954).  Representation  of  a  preference  ordering  by  a  numerical  function. 

In  R.  M.  Thrall,  C.  H.  Coombs,  and  R.  L.  Davis  (Eds.),  Decision  processes. 

Wiley,  New  York. 

- (1959).  Theory  of  value.  Wiley,  New  York. 

- (1960).  Topological  methods  in  cardinal  utility  theory.  In  K.  J.  Arrow,  S. 

Karlin,  and  P.  Suppes  (Eds.),  Mathematical  methods  in  the  social  sciences,  1959. 
Stanford  University  Press,  Stanford,  California. 

- (1964).  Continuity  properties  of  Paretian  utility.  International  Economic 

Review ,  5,  285-293. 

de  Finetti,  B.  (1937).  La  provision:  sea  lois  logiques,  ses  sources  subjectivea.  Annales 
de  VInstitut  Henri  Roincari,  7, 1-68.  Translated  by  H.  E.  Kyburg  in  Kyburg  and 
Smokier  (1964). 

Diamond,  P.  A.  (1965).  The  evaluation  of  infinite  utility  streams.  Econometrica,  33, 
170-177. 

Dubins,  L.  E.  and  Savage,  L.  J.  (1965).  How  to  gamble  if  you  must:  inequalities  for 
stochastic  processes.  McGraw-Hill,  New  York. 

Eilenberg,  S.  (1941).  Ordered  topological  spaces.  American  Journal  of  Mathematics, 

63,  39-45. 

Ellsbeig,  D.  (1954).  Classic  and  current  notions  of  “measurable  utility."  Economic 
Journal,  64,  j28-556. 

- (1961).  Risk,  ambiguity,  and  the  Savage  axioms.  Quarterly  Journal  of  Economics, 

75, 643-669. 

Fishbum,  P.  C.  (1964).  Decision  and  value  theory.  Wiley,  New  York. 

- (1967).  Methods  of  estimating  additive  utilities.  Management  Science,  13, 

435-453. 

- (1968).  Utility  theory.  Management  Science,  14,  335-378. 

- (1969).  A  general  theory  of  subjective  probabilities  and  expected  utilities. 

Armais  of  Mathematical  Statistics,  40, 1419-1429. 

Friedman,  M.  and  Savage,  L.  J.  (1948).  The  utility  analysis  of  choices  involving  risk. 
Journal  of  Political  Economy,  56, 279-304. 

- and - (1952).  The  expected-utility  hypothesis  and  the  measurability  of 

utility.  Journal  of  Political  Economy,  60, 463-474. 

Frisch,  R.  (1926).  Sur  une  probleme  d’dconomie  pure.  Norsk  Mathematish  Foremags 
Skrifter,  1, 1-40. 

- (1964).  Dynamic  utility.  Econometrica,  32, 418-424. 

Fuchs,  L.  (1963).  Partially  ordered  algebraic  systems.  Addison-Wesley,  Reading, 
Massachusetts.  Copyright:  Akadimia  Kiad6,  Budapest. 

Galanter,  E.  (1962).  The  direct  measurement  of  utility  and  subjective  probability. 
American  Journal  of  Psychology,  75, 208-220. 

Goldman,  A.  J.  (1936).  Resolution  and  separation  theorems  for  polyhedral  convex 
sets.  In  H.  W.  Kuhn  and  A.  W.  Tucker  (Eds.),  Linear  inequalities  and  related 
systems.  Annals  of  Mathematics  Study  38.  Princeton  University  Press,  Princeton, 

New  Jersey. 


References 


225 


Halmot,  P.  R.  (1950).  Measure  theory.  Van  Nostrand,  New  York. 

Hausner,  M.  (1954).  Multidimensional  utilities.  In  R.  M.  Thrall,  C.  H.  Coombs,  and 
R.  L.  Davis  (Eds.),  Decision  processes.  Wiley,  New  York. 

Hentein,  I.  N.  and  Milnor,  J.  (1955).  An  axiomatic  approach  to  measurable  utility. 
Econometrica,  21, 291-297. 

Holder,  O.  (1901).  Die  Axiome  der  Quantiti;  und  die  Lehre  vom  Mass.  Berichte 
Verhand.  K&nig.  Sdchs.  Gesell.  IViss .  (Leipzig),  Math.  Phys.  CL ,  53, 1-64. 

Hurwicz,  L.  and  Richter,  M.  K.  (1970).  Revealed  preference  without  demand  continuity 
assumptions.  In  J.  S.  Chipman,  L.  Hurwicz,  M.  K.  Richter,  and  H.  Sonnenschcin 
(Eds.),  Studies  in  the  mathematical  foundations  of  utility  and  demand  theory:  a 
symposium  at  the  University  of  Michigan,  1968.  Harcourt,  Brace  and  World,  Inc. 

Jensen,  N.  E.  (1967).  An  introduction  to  Bernoullian  utility  theory.  I.  Utility  functions. 
Swedish  Journal  of  Economics,  69, 163-183. 

Kannai,  Y.  (1963).  Existence  of  a  utility  in  infinite  dimensional  partially  ordered  spaces. 
Israel  Journal  of  Mathematics,  1, 229-234. 

Kelley,  J.  L.  (1955).  General  topology.  Van  Nostrand,  Princeton,  New  Jersey. 

Koopmans,  T.  C.  (i960).  Stationary  ordinal  utility  and  impatience.  Econometrica,  28, 
287-309. 

- ,  Diamond,  P.  A.  and  Williamson,  R.  E.  (1964).  Stationary  utility  and  time 

perspective.  Econometrica,  32,  82-100. 

Kraft,  C.  H.,  Pratt,  J.  W.  and  Sei  den  berg,  A.  (1959).  Intuitive  probability  on  finite 
sets.  Annals  of  Mathematical  Statistics,  30, 408-419. 

Krantz,  D.  H.  (1964).  Conjoint  nrewrement;  the  Luce-Tukey  axiomatization  and 
some  extensions.  Journal  of  Mathematical  Psychology,  1, 248-277. 

- (1967).  A  survey  of  measurement  theory.  Michigan  Mathematical  Psychology 

Program,  MMPP  67-4.  University  of  Michigan,  Ann  Arbor. 

Kvburg.  H.  E.  and  Smokier,  H.  E.  (Eds.,  1964).  Studies  in  subjective  probability.  Wiley, 
New  York. 

Lo&vc,  M.  (1960).  Probability  theory,  2nd  Ed.  Van  Nostrand,  Princeton,  New  Jersey. 

Luce,  R.  D.  (1956).  Semiorders  and  a  theory  of  utility  discrimination.  Econometrica, 
24,178-191. 

- (1966).  Two  extensions  of  conjoint  measurement.  Journal  of  Mathematical 

Psychology,  3,  348-370. 

- and  Raiffa,  H.  (1957).  Games  and  decisions.  Wiley,  New  York. 

- and  Suppes,  P.  (1965).  Preference,  utility,  and  subjective  probability.  In  R.  D. 

Luce,  R-  R.  Bush,  and  E.  Galanter  (Eds.),  Handbook  of  Mathematical  Psychology, 
HI.  Wiley,  New  York. 

- and  Tukey,  J.  W.  (1964).  Simultaneous  conjoint  measurement:  a  new  type  of 

fundamental  measurement.  Journal  of  Mathematical  Psychology,  1, 1-27. 

Marschak,  J.  (1950).  Rational  behavior,  uncertain  prospects,  and  measurable  utility. 
Econometrica,  18, 111-141. 

Newman,  P.  and  Read,  R.  (1961).  Representation  problems  for  preference  orderings. 
Journal  of  Economic  Behavior,  1, 149-169. 

Pareto,  V.  (1927).  Manuel  d'tconomie  politique,  2nd  Ed.  Giard,  Paris. 

Pfanzagl,  J.  (1959).  A  general  theory  of  measurement:  applications  to  utility.  Naval 
Research  Logistics  Quarterly,  6, 283-294. 

Pratt,  J.  W.,  Raiffa,  H.  and  Schiaifer,  R.  (1964).  The  foundations  of  decision  under 


226  References 

uncertainly:  an  elementary  exposition.  Journal  of  the  American  Statistical  Associa¬ 
tion,  59,  353-375. 

Rader,  J.  T.  (1963).  The  existence  of  a  utility  function  to  represent  preferences.  Review 
of  Economic  Studies,  30, 229-232. 

Raiffa,  H.  (1961).  Risk,  ambiguity,  and  the  Savage  axioms:  comment.  Quarterly 
Journal  of  Economics,  75,  690-694. 

- and  Schlaifer,  R.  (1961).  Applied  statistical  decision  theory .  Division  of  Research, 

Harvard  Business  School,  Boston. 

Ramsey,  F.  P.  (1931).  Truth  and  probability.  In  F.  P.  Ramsey,  The  foundations  of 
mathematics  and  other  logical  essays.  Harcourt,  Brace  and  Co.,  New  York, 

Reprinted  in  Kyburg  and  Smokier  (1964). 

Richter,  M.  K.  (1966).  Revealed  preference  theory.  Econometrica,  34, 635-645. 

Roberts,  F.  S.  (1969).  Indifference  graphs.  In  F.  Harary  (Ed.),  Proof  techniques  in 
graph  theory.  Academic  Press,  New  York. 

Savag?,  L.  2.  (1954).  The  foundations  of  statistics.  Wiley,  New  York. 

Scott,  D.  (1964).  Measurement  structures  and  linear  inequalities.  Journal  of  Mathe¬ 
matical  Psychology,  1, 233-247. 

- and  Suppes,  P.  (1958).  Foundational  aspects  of  theories  of  measurement. 

Journal  of  Symbolic  Logic,  23, 113-128. 

Suppes,  P.  (1956).  The  role  of  subjective  probability  and  utility  in  decision-making. 
Proceedings  of  the  Third  Berkeley  Symposium  on  Mathematical  Statistics  and 
Probability,  1954-1955,  5, 61-73. 

- and  Winet,  M.  (1955).  An  axiomatization  of  utility  based  on  the  notion  of  utility 

differences.  Management  Science,  1, 259-270. 

- and  Zinnes,  I.  L.  (1963).  Basic  measurement  theory.  In  R.  D.  Luce,  R.  R. 

Bush,  and  E.  Galanter  (Eds.),  Handbook  of  Mathematical  Psychology,  I.  Wiley, 

New  York. 

Swalm,  R.  O.  (1966).  Utility  theory— insights  into  risk  taking.  Harvard  Business 
Review,  November-December,  123-136. 

Szpilrajn,  E.  (1930).  Sur  l’extension  de  1’ordre  partiel.  Fundamenta  Mathematicae,  16, 
386-389. 

Thielman,  H.  P.  (1953).  Theory  of  functions  of  real  variables.  Prentice-Hall,  Englewood 
Cliffs,  New  Jersey. 

Thomsen,  G.  (1927).  Un  teorema  topologico  sulle  schiere  di  curve  e  una  caratteriz- 
zazione  geometries  delle  superficie  isotermoasintotiche.  BoUettino  della  Unione 
Matematica  Italiana,  6, 80-85. 

Tucker,  A.  W.  (1956).  Dual  systems  of  homogeneous  linear  relations.  In  H.  W.  Kuhn 
and  A.  W,  Tucker  (Eds.).  Linear  Inequalities  and  related  systems.  Annals  of 
Mathematics  Study  38.  Princeton  University  Press,  Princeton,  New  Jersey. 

Tversky,  A.  (1964).  Finite  additive  structures.  Michigan  Mathematical  Psychology 
Program ,  MMPP  64-6.  University  of  Michigan,  Ann  Arbor. 

- (1967).  A  general  theory  of  polynomial  conjoint  measurement.  Journal  of 

Mathematical  Psychology,  4, 1-20. 

von  Neumann,  J.  and  Morgenstern,  O.  (1947).  Theory  of  games  and  economic  behavior, 
2nd  Ed.  Princeton  University  Press,  Princeton,  New  Jersey. 


References 


an 


^iwS^^assr of  ***■ 

— —  -  «pM  <™,. 

'^*S^ES2S£!S2&  ■*—  -**■ 

qu*ftifi<fion  of  *>me  methodological  suctions 

o/  /Ae  American  Statistical  Association,  62, 1 105-1 120  U®B* 

°rpure  '■  "•  “ 

-  and  Juweii,  L.  (1953).  Demand  analysis.  Wiley,  New  York. 

Yokoyama,  T.  (1956).  Continuity  condition*  of  Dreferwice  nr.iw  n  l  r 

Papers,  4,  35M5.  7  preference  ordering.  OjwAo  Economic 


AUTHOR  INDEX 


Adum,  E.  W„  46, 47, 82, 83, 223 
Alkfe.  M 109, 223 
Amcombe,  F.  J„  167, 173, 223 
Armstrong,  W.  E.,  12, 80, 81, 83, 223 
Arrow  K.  J„  24, 167, 223 
Aumsnn,  R.  J„  46, 121, 126, 127, 167, 
175,223 

Bernoulli,  D„  103, 130,  223 
BirkhofT,  G„  14,29,  223 
Blackwell,  D„  103, 143, 147, 223 
Blaschke,  W.,  63,  223 

Cnemoff,  H„  3,167,  223 
Chipman,  J.  S„  25,  223 
Coombs,  C.  H.,  20,  224 
Cramer,  H.,  103,  224 

Debreu,  G„  29,  36,  37, 34, 62, 65 fT,  84, 
88,94, 96, 97,224 
de  Finetti,  B„  4, 161,191,224 
Diamond,  P.  A.,  89,  224, 225 
Dubins.L.E.,  132,  224 

Eilenbetg,  S„  37, 224 
EQtberg,  D„  82, 172,  224 

Fishbum.  P.  C„  2. 4, 43, 104, 167, 223, 
224 

Friedman,  M„  103, 224 
Frisch,  R.,  81, 82, 224 
Fuchs,  L.,  53, 224 

Galanter,  E.,  88, 224 

Girshick,  M,  A.,  103, 145, 147, 223 

Goldman,  A.  J„  46,  224 

Halmos,  P.  R.,  131, 223 


Hausner,  M„  110, 223 
Hetstein,  L  N.,  103, 110, 223 
Milder,  0„  55, 225 
Hurwicz,  L„  39, 225 

Jensen,  N.E.,  110, 225 
Jureen,  L„  37,  227 

Kannai,  Y.,  121,225 
Kelley,  J.L.,  16, 225 
Koopmans,  T.C.,  89, 96,  225 
Kraft, C.H.,  210,211,225 
Krantz,  D.  H„  2, 54, 58, 59, 78, 225 
Kyburg,  H.  E.,  4, 225 

Lcnrine,  M.,  87 
Lofcve,M„  131, 225 

Luce,  R.  D.,  2, 18.  20, 29, 54, 57, 58, 59, 
65,76, 87,103,110, 225 

Marschak,  I.,  103, 225 

Milnor,  J„  103, 110, 223 

Morgenstern,  O.,  101, 103, 161, 191, 226 

Nassar,  J.I.,  96,98,227 
Newman,  P„  37, 225 

Pareto,  V„  81, 82, 225 

Pfanzagl,  J„  85, 86, 87, 88,115, 118, 225 

Pratt,  J.  W„  104, 167, 210, 21 1, 225 

Rader,  J.T.,  37, 226 

Raiffa,  H„  2, 103, 104, 167, 172, 225, 226 

Ramsey,  F.  P„  101, 161, 167, 191 , 226 

Read,  R„  37, 225 

Richter,  M.  K„  30, 39, 225, 226 

Roberts,  F.S.,  24, 226 

Rubin,  H„  5 


229 


Savage,  L.  J„  2, 4,  S,  103, 109, 172, 138, 

*  ,6S* 166‘  167» 191ff*  2W.  226 

Schlaifer,  R„  2, 104, 167, 223, 226 
Soctt,D„  18.22,43,45,46,32,83,84, 
210, 226 


Siedenberg,  A.,  210,  211, 225 
Smokier,  H.  E„  4, 223 

SuJ??’P"  l4>  18‘ 22'  29‘ 43*  45‘  W.  84, 

85,87,110,225,226 
Swalm,  R,  0.,  104, 226 
Szpilrqjn,  E.,  16, 226 


TNelman,  H.  P„  35, 226 
Thomaen,  G.,  65,  226 
Tucker,  A.  W„  46,  226 


Tukey,  J.  W.,  54, 57, 65, 76, 87, 225 
Tvewky,  A.,  46, 51, ‘  76, 226 

von  Neumann,  J„  101, 103, 161, 191,  226 

Weldon,  J.C.,  82,  227 
Willlanu,  A.  C„  96, 98, 227 
WUHanuon,  R.  E„  89, 225 

Wtoet.MH80,84,224 

Winkler,  R.L.,  103,227 
Wold,  H.,  37,  39,41,227 

Yokoyama,  T„  37, 227 

Zinnea,  J.  L„  14, 22, 84, 85, 87, 226 


| 


SUBJECT  INDEX 


Absolute  preference -difference  compari¬ 
sons,  80 

Absolute  value,  27 
Act,  big,  209 
bounded,  209 
constant,  166, 179, 193 
as  function  on  states  to  consequences,  165, 
175, 192 
little,  209 
normal,  209 

perfect  information,  169 
Act-state  pair,  167 
Additive  utility,  42, 44, 54, 91 
in  expected  utility,  148ff 
in  states  model,  168ff 
unbounded,  58 

unique  up  to  similar  positive  linear  trans¬ 
formations,  54 
weighted,  93 
Algebra  of  sets,  130 
generated,  131 
Alternating  sequence,  151 
Antisymmetry,  10 
Asymmetry,  10 
Axiom  of  Choice,  17 

Binary  relation,  lOff 
properties  of,  10,  11, 18 
transitive  closure  of,  24 
see  also  Equivalence,  Order,  Qualitative 
probability 
Bisection  axiom,  85 
Bisymmetry  axiom,  85 
Boolean  algebra,  130 
generated,  131 
Bore!  albebra,  131 
Basel  sot,  131 
Boundary  point,  123 


Bounded  act,  190 
Bounded  utility,  138, 206 
Buying  price,  118f 

Cancellation  law,  51 
Cartesian  product,  25, 35, 42 
Certainty  equivalent,  117 
Choice  function,  24 
Closed  interval,  26 

Closure,  under  conditional  probabilities,  134 
convex,  122 

under  countable  convex  combinations,  132 
under  group  operation,  54 
under  mixture  set  operation,  110 
in  topological  space,  62 
Compensation  axiom,  57 
Complement,  130, 175 
relative,  195 

Complete  binary  relation,  11 
Component,  26 
Conditional  preference,  192 
Conditional  probability  measure,  133 
Cone, 123 
convex,  123 

Connected  binary  relation,  1 1 
Connected  topological  space,  40, 62 
Consequence,  103, 166, 175 
Constant  act,  166, 179, 193 
Consumption  theory,  35, 82 
Continuity,  35, 64 
of  additive  utilities,  65,  72 
upper  semi-,  38 
Wold’s  axiom  for,  41 
Convergence,  nonuniform,  137 
uniform  from  above,  146 
uniform  from  below,  135 
Convex  closure,  47 

Convex  combination  of  measures,  106 


231 


232 


Subject  Index 


countable,  131, 132 
Convex  cone,  123 
Convex  Mt,  41, 122 
closure  of,  122 
Countable  additivity,  132 
compared  to  finite  additivity,  136f 
Cycle,  151 

Deciaion  making,  1 
under  certainty,  2 
under  uncertainty,  4 
Degree  of  preference,  80,  88 
Difference  of  sets,  27, 195 
Diffuae  probability  measure,  132 
Discount  factor,  93 
constant,  96 

Discrete  probability  measure,  133 
Dominance  conditions,  32, 108, 137, 179, 
192 

Equivalence,  relation,  11 
*>,  15 
Em-  44 

Equivalence  classes,  1 1 
notation  for,  12 
Essential  factor,  71 
Ethically  neutral  proposition,  191 
Euclidean  space,  26, 31 
Even-chance  gamble,  86, 149, 189, 191 
Event,  null,  176, 192 
as  subset  of  states,  175 
Expected  utility,  additive,  148ff 
Archimedean  axiom  for,  109 
from  extraneous  probabilities,  175ff 
independence  axiom  for,  108 
lexicographic,  110 
maximum,  105 

for  probability  measures,  129ff 
Savage’s  theory  of,  191ff 
scaling  of,  104 
with  simple  measures,  103ff 
for  states  model,  166f 
Expected  value,  106, 135, 136 
Extension,  linear,  142 
of  strict  partial  order,  16 
Extraneous  probabilities,  167, 175ff 

Finite  additivity,  105 
Function,  bounded,  135 
linear,  118 


measurable,  135 

tee  alio  Expected  utility,  Utility 

Gamble,  108 

buying  and  selling  prices  of,  1 18f 
50-50  (even-chance),  86, 149, 189, 191 
tee  alto  Probability  measure 
Gap  in  function,  36 
Greatest  lower  bound,  see  Infimum 
Group, 54f 

Homogeneous  product  set,  89, 156, 178 
Hone  lottery,  175 
bounded,  179 
homogeneous,  178 

Ideal  point,  20 
Impatience,  90 
Independence  among  sets,  43 
test  fcr,  46 
Indifference,  9, 12 
intransitive,  12, 15f,  81, 138, 169 
Indifference  curve,  31 
Indifference  hypersurface,  74 
Indifference  interval,  20f 
Indifference  loci,  31 
Indifference  map,  31 
Induced  probability  measure,  201 
Infimum  (inf),  28 
Intersection  of  sets,  35 
Interval,  closed,  26 
open,  26 

Interval  graph,  24 
Interval  order,  20 

Intransitive  indifference,  see  Indifference 
Inverse,  of  function,  64, 66 
of  group  element,  55 
Irreflexivity,  10 
Isoutility  contour,  31 

Least  upper  bound,  tee  Supremum 
Lexicographic  order  «L),  25, 48 
Linear  extension,  14 
Linear  function,  118 
Linear  order,  see  Strict  order 
Lottery,  116 

see  also  Gamble,  Horse  lottery,  Probability 
measure 

Marginal  consistency,  98 


in  SJ-J.,  f 


Subject  Index 


233 


9*> 


& 

r- 


& 


i 


I 


Marginal  probability  measure,  148, 153 
Markup  factor,  93 
Maximal  independent  subset,  141 
Maximum  net  profit,  115 
Measurable  function,  135 
bounded,  135 
simple,  135 
Measurable  utility,  82 
Measure,  see  Probability  measure 
Midpoint  axiom,  85 
Mixture  set,  110 
Monotonicity  condition,  32 

Negative  transitivity,  10 
Nonsaturation  condition,  32 
Norma!  probability  distribution,  79 
Null  event,  176, 192 
Null  state,  169 

Open  interval,  26 
Open  set,  35 
Order,  interval,  20 
lexicographic,  25, 48 
quad,  24, 121, 127 
semi-,  20 
strict,  11 
strict  partial,  15 
weak,  U 
Order  dense,  27 
Ordered  pair,  10 

Order-preserving  utility  function,  14 

Partition,  24, 175 
uniform,  194, 198 
Perfect  information  act,  169 
Persistent  preference,  90, 156 
Persistent  preference  differences,  92 
Preference,  1 
conditional,  192 
consistent  judments  of,  109 
degree  of,  80,  88 
impatient,  90 
marginally  consistent,  98 
persistent,  90, 92, 156 
single-peaked,  19 
stationary,  96, 156 
strict, 9 
time,  89ff 
see  also  Order 
Preference  differences,  8 Off 


nontopological  axioms  fox,  85 
persistent,  92 
topological  axioms  for,  84 
Preference  independence,  43 
Preference-indifference,  13 
Preference  intensity,  see  Degree  of  preference 
Preorder,  see  Quad  order 
Probability,  axioms  for,  194,  210 
extraneous,  167 
from  preference,  200 
qualitative,  see  Qualitative  probability 
subjective,  personal,  4 
Probability  distribution,  geometric,  133 
normal,  79 
Poisson,  133 
Probability  measure,  131 
conditional,  133 

convex  combination  of,  106, 131 
countably  additive,  132 
diffuse,  132 
discrete,  133 
finitely  additive,  105 
on  homogenous  product  set,  156 
induced,  201 
marginal,  148, 153 
simple,  105 
on  states,  164, 165 
zero-one,  188 
Product,  scalar,  40, 46 
of  sets,  see  Product  set 
of  topological  spaces,  63 
Product  set,  25,  32, 42 
homogeneous,  89, 156, 178 
Product  space,  63 
Projection,  154 

Qualitative  probability,  193, 195 
fine,  212 
tight,  212 

Quad  order,  24, 121, 127 

Random  variable,  135 
Reflexivity,  10 

St.  Petersburg  game,  1 30 
Scalar  product,  40, 46 
Selling  price,  118f 
Semiorder,  20 

Separable  topological  space,  62 
Set,  Borel,  131 


i 

i 

i 


i 


i 

s 

3 


1 

I 


I 


i 

4 


Subject  Index 


closure  of,  62 
complement  of,  130 
convex,  41, 122 
convex  closure  of,  47 
countable,  9 
denumerable,  9 
open,  35 
uncountable,  26 
Sets,  Boolean  algebra  of,  130 
Borel  algebra  of,  131 
difference  of,  27, 195 
intersection  of,  35 
product  of,  25, 42 
cr-algebra  of,  130 
union  of,  27, 35 

Simple  probability  measure,  105 
Single-peaked  preferences,  19 
Solvability  assumption,  57, 58, 60, 65 
States,  as  functions  on  acts  to  consequences, 
163, 185 
null,  169 

probability  measures  on,  164, 165 
of  the  world,  164 
Stationary  preference,  96, 156 
Strength  of  preference,  see  Degree  of  prefer¬ 
ence 

Strictly  ordered  group,  55 
Strict  order,  11 
Strict  panial  order,  15 
extension  of,  16 
Subset,  10 

maximal  independent,  141 
Supremum  (sup),  27,  206 
Sure-thing  axioms,  138, 179, 193 
Symmetry,  10 
Szpiirajn's  theorem,  16 

Temporal  consistency,  96 
Theorem  of  The  Alternative,  46 
applications  of,  52, 169, 174,  210 
Time  preference,  89ff 
Topological  space,  35 
connected,  40, 62 
connected  subset  of,  62 
separable,  62 


Topolc  '• ,  35 
discre  .  40 
product,  63 
relative  usual,  35 
usual,  35 

Trade-off  curve,  31 

Transformations  of  utility  functions,  15, 54, 
84 

Transitive  closure,  24 
Transitivity,  10 
negative,  10 

Unbounded  utility,  58, 116 
Uncountable  set,  26 
Uniform  convergence,  135, 146 
Uniform  partition,  194, 198 
Union  of  sets,  27, 35 
Uniqueness,  up  to  a  positive  linear  trans¬ 
formation,  84 

up  to  similar  positive  linear  transforma¬ 
tions,  54 

Upper  semicontinuity,  37 
Usual  topology,  35 
Utility,  additive,  42, 44, 54, 91 
bounded,  138, 206 
continuous,  35 

expected,  see  Expected  utility 
lexicographic,  48 
measurable,  82 
order-preserving,  14 
unbounded, 58, 116 
uniqueness  of,  see  Uniqueness 
upper  semicontinuous,  38 
weighted,  93 

Vector  addition,  31 
Vector  space,  47 

Weakly  connected,  11 
Weak  order,  11 
Weighted  additivity,  93 
Wold's  continuity  condition,  41 

Zorn’s  Lemma,  16 
applications  of,  141, 152, 188 


DOCUMENT  CONTROL  DATA  •  RAD 


l taaaity  jteNjjiwtfgi  at  litis,  My  of  itiwu  <M  fawtewlw#  ■nwtoHm  wml  Pa  wrftwrf  a*<a  »<  wmH  wy>«  li  e»ii<»Rtf 


1.  Oft.eiMATINO  ACTIVITY  IClVMH  wHWfJ 

Research  Analysis  Corporation 

McLean,  Virginia  22101 

UNCLASSIFIED 

14  IROU. 

1-  RIPORT  TITLE 

UTILITY  THEORY  FOR  DECISION  MAKING 

4  DCtcniATIVI  NOTES  <T rp.  .1  npM  w  intbdn  Nan.; 

Final  report 

S-  AUTHORfij  ( Fir  at  name,  nidM*  initial.  laat  nan*) 

Peter  C.  Fishburn 

REPORT  OAT* 

June  1970 

224  107 

•  «.  CONTRACT  on  onant  NO. 

DA:  No.  44~  188- ARO- 1 

A  NR0JCCT  NO 

ONR:  No.  N00014-67-C- 0434 

090.101 

< 

RAC-R-105 

lb.  £>th*R  Report  ho(3)  (Any  othar  nunabam  *i  mi  may  ba 

1«  DISTRIBUTION  STATEMENT 

This  document  has  been  approved  for  public  release  and  sale; 
its  distribution  is  unlimited. 

t2-  SPONSORING  MILITARY  ACTIVITY 

is  abstract 


The  book  presents  a  concise  yet  mathematically  complete 
treatment  of  modern  utility  theories  that  covers  nonprobabilistic 
preference  theory,  the  von  Neumann- Morgenstern  expected-utility 
theory  and  its  extensions,  and  the  joint  axiomatization  of  utility  and 
subjective  probability.  (  j 


DD  Tv-..  1473 


MIT  *•«•» 


decision 

utility 

value 

probability 

preference 


